Language-Guided Visual Aggregation Network for Video Question Answering.
Xiao LiangDi WangQuan WangBo WanLingling AnLihuo HePublished in: ACM Multimedia (2023)
Keyphrases
- question answering
- natural language
- passage retrieval
- natural language processing
- qa clef
- information extraction
- question answering systems
- named entities
- question classification
- visual data
- cross language
- sentence retrieval
- video data
- information retrieval
- video sequences
- natural language questions
- video search
- visual information
- multimedia
- answer validation
- qa systems
- relation extraction
- visual features
- video content
- relational databases