(2.5+1)D Spatio-Temporal Scene Graphs for Video Question Answering.
Anoop CherianChiori HoriTim K. MarksJonathan Le RouxPublished in: AAAI (2022)
Keyphrases
- question answering
- spatio temporal
- video sequences
- moving objects
- image sequences
- space time
- dynamic textures
- human actions
- video data
- information retrieval
- visual data
- information extraction
- question classification
- natural language processing
- video content
- cross language
- passage retrieval
- named entities
- video streams
- relation extraction
- natural language questions
- natural language
- question answering systems
- sentence retrieval
- qa clef
- syntactic information
- semantic roles
- human motion
- video frames
- video analysis
- multimedia
- answer validation
- textual entailment recognition
- video retrieval
- artificial intelligence
- open domain question answering
- graph structure
- data mining
- key frames
- candidate answers
- action recognition