Sign in

TMT: A Transformer-Based Modal Translator for Improving Multimodal Sequence Representations in Audio Visual Scene-Aware Dialog.

Wubo LiDongwei JiangWei ZouXiangang Li
Published in: INTERSPEECH (2020)
Keyphrases