HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video Clips.

Published in: CoRR (2019)

Keyphrases