Sign in

InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation.

Yi WangYinan HeYizhuo LiKunchang LiJiashuo YuXin MaXinyuan ChenYaohui WangPing LuoZiwei LiuYali WangLimin WangYu Qiao
Published in: CoRR (2023)
Keyphrases