Video Question Answering Using Clip-Guided Visual-Text Attention.
Shuhong YeWeikai KongChenglin YaoJianfeng RenXudong JiangPublished in: ICIP (2023)
Keyphrases
- question answering
- video clips
- video search
- information retrieval
- syntactic information
- news video
- text summarization
- video content
- video data
- textual entailment recognition
- natural language processing
- key frames
- question classification
- video streams
- natural language
- visual data
- information extraction
- video sequences
- text retrieval
- multimedia
- question answer pairs
- relation extraction
- video retrieval
- passage retrieval
- free text
- visual features
- video frames
- qa clef
- question answering systems
- text mining
- named entities
- visual information
- natural language questions
- multi modal
- answer extraction
- cross language
- video shots
- semantic information
- candidate answers
- word sense disambiguation
- text documents
- test collection
- answer validation
- co occurrence
- artificial intelligence