VLAB: Enhancing Video Language Pre-training by Feature Adapting and Blending.

Published in: CoRR (2023)

Keyphrases