Equivariant and Invariant Grounding for Video Question Answering.
Yicong LiXiang WangJunbin XiaoTat-Seng ChuaPublished in: ACM Multimedia (2022)
Keyphrases
- question answering
- information retrieval
- natural language
- natural language processing
- video data
- cross language
- question classification
- multimedia
- video sequences
- information extraction
- question answering systems
- video content
- passage retrieval
- video frames
- syntactic information
- named entities
- qa clef
- natural language questions
- open domain question answering
- relation extraction
- data mining
- text retrieval
- semantic roles
- qa systems
- answering questions