Modality Shifting Attention Network for Multi-Modal Video Question Answering.
Junyeong KimMinuk MaTrung X. PhamKyungsu KimChang D. YooPublished in: CVPR (2020)
Keyphrases
- multi modal
- question answering
- video search
- semantic concepts
- multiple modalities
- natural language processing
- information extraction
- question classification
- qa clef
- syntactic information
- multi modality
- video sequences
- information retrieval
- video streams
- audio visual
- passage retrieval
- cross modal
- single modality
- multimedia
- question answering systems
- video content
- natural language
- visual information
- video data
- candidate answers
- image classification
- natural language questions
- answer validation