Leveraging Topics and Audio Features with Multimodal Attention for Audio Visual Scene-Aware Dialog.

Published in: ViGIL@NeurIPS (2019)

Keyphrases