Good for Misconceived Reasons: An Empirical Revisiting on the Need for Visual Context in Multimodal Machine Translation.
Zhiyong WuLingpeng KongWei BiXiang LiBen KaoPublished in: ACL/IJCNLP (1) (2021)
Keyphrases
- machine translation
- visual context
- temporal context
- object detection
- audio visual
- scene interpretation
- natural language processing
- semantic context
- visual scene
- multi modal
- information extraction
- cross language information retrieval
- statistical machine translation
- target language
- natural language
- visual words
- information retrieval
- spatial context
- computer vision