ViLLa: Video Reasoning Segmentation with Large Language Model.
Rongkun ZhengLu QiXi ChenYi WangKun WangYu QiaoHengshuang ZhaoPublished in: CoRR (2024)
Keyphrases
- language model
- language modeling
- n gram
- speech recognition
- information retrieval
- probabilistic model
- document retrieval
- language modelling
- query expansion
- mixture model
- statistical language models
- multimedia
- context sensitive
- word segmentation
- language model for information retrieval
- ad hoc information retrieval
- video data
- word error rate
- vector space model
- image segmentation
- smoothing methods
- query terms
- translation model
- document ranking
- retrieval model
- language models for information retrieval
- automatic speech recognition
- document length
- dirichlet prior
- statistical language modeling
- word clouds