Coarse to Fine Frame Selection for Online Open-ended Video Question Answering.
Sai Vidyaranya NuthalapatiAnirudh TungaPublished in: ICCV (Workshops) (2023)
Keyphrases
- coarse to fine
- question answering
- open ended
- successive frames
- multiscale
- multiresolution
- video frames
- object detection
- question classification
- learning outcomes
- natural language processing
- information retrieval
- image registration
- information extraction
- question answering systems
- passage retrieval
- cross language
- video sequences
- key frames
- named entities
- natural language
- syntactic information
- natural language questions
- qa clef
- answering questions
- dynamic programming
- video content
- qa systems
- video data
- computer vision
- candidate answers
- visual features
- multimedia
- answer validation