SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models.
Mingze XuMingfei GaoZhe GanHong-You ChenZhengfeng LaiHaiming GangKai KangAfshin DehghanPublished in: CoRR (2024)
Keyphrases
- language model
- language modeling
- n gram
- language modelling
- document retrieval
- trec test collections
- speech recognition
- information retrieval
- probabilistic model
- multimedia
- retrieval model
- statistical language models
- smoothing methods
- video data
- word error rate
- okapi bm
- test collection
- video content
- context sensitive
- query expansion
- language model for information retrieval
- vector space model
- video frames
- language models for information retrieval
- video sequences
- ad hoc information retrieval
- training set
- query terms
- translation model
- pseudo relevance feedback
- cross lingual
- machine learning
- trec collections
- document ranking
- knn
- relevance model
- text retrieval