Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding.

Published in: CoRR (2016)

Keyphrases