Structured Two-stream Attention Network for Video Question Answering.
Lianli GaoPengpeng ZengJingkuan SongYuan-Fang LiWu LiuTao MeiHeng Tao ShenPublished in: CoRR (2022)
Keyphrases
- question answering
- information retrieval
- information extraction
- video data
- question classification
- named entities
- video sequences
- natural language processing
- passage retrieval
- cross language
- natural language questions
- qa clef
- natural language
- relation extraction
- structured data
- video frames
- semantic roles
- textual entailment recognition
- open domain question answering
- answer extraction
- video content