Understanding Video Scenes through Text: Insights from Text-based Video Question Answering.
Soumya JahagirdarMinesh MathewDimosthenis KaratzasC. V. JawaharPublished in: ICCV (Workshops) (2023)
Keyphrases
- question answering
- video scene
- video data
- scene analysis
- news video
- syntactic information
- video analysis
- information retrieval
- video sequences
- natural language
- video shots
- video frames
- video clips
- information extraction
- multimedia
- natural language processing
- named entities
- dynamic scenes
- audio visual
- moving objects
- passage retrieval
- semantic information
- video streams
- question answering systems
- text mining
- video database
- question answer pairs
- video search
- video retrieval
- video content
- visual features
- multiple features