From Pixels to Objects: Cubic Visual Attention for Visual Question Answering.
Jingkuan SongPengpeng ZengLianli GaoHeng Tao ShenPublished in: IJCAI (2018)
Keyphrases
- question answering
- visual attention
- visual scene
- visual input
- focus of attention
- visual search
- object based visual attention
- eye tracking
- vision system
- saliency map
- visual saliency
- eye movements
- question classification
- information retrieval
- visual information
- higher level
- natural language processing
- passage retrieval
- salient regions
- qa clef
- syntactic information
- question answering systems
- natural language
- visual motion
- visual data
- information extraction
- computer vision
- spatial relations
- visual features
- input image
- low level features
- image regions
- qa systems
- low level
- object recognition
- high level