Generating Natural Language Explanations for Visual Question Answering using Scene Graphs and Visual Attention.
Shalini GhoshGiedrius BurachasArijit RayAvi ZiskindPublished in: CoRR (2019)
Keyphrases
- question answering
- visual attention
- visual scene
- natural language
- visual input
- saliency model
- visual search
- saliency map
- visual motion
- eye movements
- vision system
- object based visual attention
- natural language processing
- visual saliency
- eye tracking
- information extraction
- natural language questions
- visual information
- information retrieval
- question answering systems
- higher level
- passage retrieval
- salient regions
- semantic analysis
- question classification
- knowledge representation
- visual data
- qa clef
- computer vision
- image regions
- human computer interaction