Sign in

Multimodal Incremental Transformer with Visual Grounding for Visual Dialogue Generation.

Feilong ChenFandong MengXiuyi ChenPeng LiJie Zhou
Published in: ACL/IJCNLP (Findings) (2021)
Keyphrases
  • visual information
  • visual features
  • high level
  • multi modal
  • visual analysis
  • machine learning
  • multiscale
  • natural language
  • low level
  • incremental learning
  • visual data
  • visual search