CLEVR3D: Compositional Language and Elementary Visual Reasoning for Question Answering in 3D Real-World Scenes.
Xu YanZhihao YuanYuhao DuYinghong LiaoYao GuoZhen LiShuguang CuiPublished in: CoRR (2021)
Keyphrases
- question answering
- real world scenes
- natural language
- complex scenes
- answering questions
- natural language processing
- information extraction
- information retrieval
- knowledge representation
- spatial relationships
- real scenes
- question classification
- qa clef
- syntactic information
- passage retrieval
- cross language
- natural language questions
- visual information
- visual features
- image sequences
- qa systems
- low level
- question answering systems
- high level
- knowledge base
- spatial relations
- action recognition
- candidate answers
- artificial intelligence
- data mining