Spatiotemporal-Textual Co-Attention Network for Video Question Answering.
Zheng-Jun ZhaJiawei LiuTianhao YangYongdong ZhangPublished in: ACM Trans. Multim. Comput. Commun. Appl. (2019)
Keyphrases
- question answering
- natural language
- question classification
- multimedia
- information retrieval
- cross language
- passage retrieval
- natural language processing
- video data
- named entities
- qa clef
- natural language questions
- sentence retrieval
- question answering systems
- open domain question answering
- answer validation
- syntactic information
- relation extraction
- information extraction
- video sequences
- candidate answers
- key frames
- video frames
- answering questions
- artificial intelligence