Login / Signup

Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal Attentions for Video Captioning.

Xin WangYuan-Fang WangWilliam Yang Wang
Published in: NAACL-HLT (2) (2018)
Keyphrases