Prompting Visual-Language Models for Efficient Video Understanding.
Chen JuTengda HanKunhao ZhengYa ZhangWeidi XiePublished in: CoRR (2021)
Keyphrases
- language model
- language modeling
- speech recognition
- probabilistic model
- document retrieval
- n gram
- language modelling
- video sequences
- statistical language models
- visual data
- video data
- retrieval model
- video frames
- multimedia
- visual information
- query expansion
- multi modal
- information retrieval
- ad hoc information retrieval
- language models for information retrieval
- video retrieval
- visual features
- vector space model
- pseudo relevance feedback
- video search