Deep Learning Based Multi-modal Addressee Recognition in Visual Scenes with Utterances.

Published in: IJCAI (2018)

Keyphrases