Can't make an Omelette without Breaking some Eggs: Plausible Action Anticipation using Large Video-Language Models.
Himangi MittalNakul AgarwalShao-Yuan LoKwonjoon LeePublished in: CoRR (2024)
Keyphrases
- language model
- language modeling
- human actions
- probabilistic model
- document retrieval
- n gram
- language modelling
- video sequences
- statistical language models
- speech recognition
- test collection
- retrieval model
- query expansion
- information retrieval
- video data
- video frames
- context sensitive
- smoothing methods
- ad hoc information retrieval
- action recognition
- multimedia
- document ranking
- language models for information retrieval
- language model for information retrieval
- vector space model
- video content
- query terms
- multi modal
- pseudo relevance feedback
- document length