(2.5+1)D Spatio-Temporal Scene Graphs for Video Question Answering.
Anoop CherianChiori HoriTim K. MarksJonathan Le RouxPublished in: CoRR (2022)
Keyphrases
- question answering
- spatio temporal
- video sequences
- moving objects
- image sequences
- dynamic textures
- space time
- human actions
- question classification
- natural language processing
- visual data
- video data
- natural language
- passage retrieval
- information retrieval
- cross language
- syntactic information
- video frames
- qa clef
- multimedia
- named entities
- question answering systems
- video database
- relation extraction
- natural language questions
- information extraction
- sentence retrieval
- human motion
- video content
- answer validation
- open domain question answering
- video retrieval
- qa systems
- video shots
- video analysis
- video streams
- artificial intelligence
- semantic roles
- answer extraction
- action recognition