Prompting Visual-Language Models for Efficient Video Understanding.
Chen JuTengda HanKunhao ZhengYa ZhangWeidi XiePublished in: ECCV (35) (2022)
Keyphrases
- language model
- language modeling
- n gram
- probabilistic model
- document retrieval
- speech recognition
- information retrieval
- video sequences
- retrieval model
- language modelling
- relevance model
- multimedia
- statistical language models
- video content
- query expansion
- test collection
- vector space model
- visual information
- query terms
- language models for information retrieval
- key frames
- video retrieval
- error rate
- video data
- multi modal