VQA-GNN: Reasoning with Multimodal Knowledge via Graph Neural Networks for Visual Question Answering.
Yanan WangMichihiro YasunagaHongyu RenShinya WadaJure LeskovecPublished in: ICCV (2023)
Keyphrases
- question answering
- knowledge representation
- knowledge base
- information retrieval
- syntactic information
- cross language
- information extraction
- natural language processing
- named entities
- natural language
- answering questions
- domain knowledge
- question answering systems
- open domain question answering
- sentence retrieval
- relation extraction
- visual information
- visual features
- passage retrieval
- image database
- question classification
- natural language questions
- multi modal
- structured data
- data mining
- answer extraction
- qa clef
- answer validation