Sign in

MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions.

Mattia SoldanAlejandro PardoJuan León AlcázarFabian Caba HeilbronChen ZhaoSilvio GiancolaBernard Ghanem
Published in: CVPR (2022)
Keyphrases