Sign in

LiteVL: Efficient Video-Language Learning with Enhanced Spatial-Temporal Modeling.

Dongsheng ChenChaofan TaoLu HouLifeng ShangXin JiangQun Liu
Published in: CoRR (2022)
Keyphrases