Human Attention in Visual Question Answering: Do Humans and Deep Networks Look at the Same Regions?
Abhishek DasHarsh AgrawalLarry ZitnickDevi ParikhDhruv BatraPublished in: Comput. Vis. Image Underst. (2017)
Keyphrases
- question answering
- visual stimuli
- question classification
- information extraction
- information retrieval
- natural language processing
- qa clef
- syntactic information
- named entities
- natural language questions
- natural language
- visual information
- relation extraction
- visual features
- cross language
- passage retrieval
- open domain question answering
- visual attention
- eye movements
- semantic roles
- multi modal
- answering questions
- co occurrence
- textual entailment recognition
- relevance feedback