Sign in

Focus and Align: Learning Tube Tokens for Video-Language Pre-Training.

Yongqing ZhuXiangyang LiMao ZhengJiahao YangZihan WangXiaoqian GuoZifeng ChaiYuchen YuanShuqiang Jiang
Published in: IEEE Trans. Multim. (2023)
Keyphrases