Multimodal Graph Networks for Compositional Generalization in Visual Question Answering.
Raeid SaqurKarthik NarasimhanPublished in: NeurIPS (2020)
Keyphrases
- question answering
- information extraction
- natural language processing
- question classification
- information retrieval
- qa clef
- visual information
- natural language
- question answering systems
- graph structure
- cross language
- natural language questions
- passage retrieval
- visual features
- named entities
- relation extraction
- sentence retrieval
- structured data
- syntactic information
- qa systems
- candidate answers
- multi modal
- answer validation
- textual entailment recognition
- open domain question answering
- machine learning
- low level