Self-Chained Image-Language Model for Video Localization and Question Answering.
Shoubin YuJaemin ChoPrateek YadavMohit BansalPublished in: CoRR (2023)
Keyphrases
- question answering
- language model
- passage retrieval
- information retrieval
- sentence retrieval
- language modeling
- n gram
- information extraction
- document retrieval
- image content
- natural language
- image retrieval
- named entities
- image classification
- probabilistic model
- visual data
- retrieval model
- natural language processing
- speech recognition
- query terms
- key frames
- hidden markov models
- machine learning
- multimedia