VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks.
Jiannan WuMuyan ZhongSen XingZeqiang LaiZhaoyang LiuWenhai WangZhe ChenXizhou ZhuLewei LuTong LuPing LuoYu QiaoJifeng DaiPublished in: CoRR (2024)
Keyphrases
- end to end
- language model
- language modeling
- n gram
- probabilistic model
- document retrieval
- information retrieval
- statistical language models
- mixture model
- speech recognition
- congestion control
- test collection
- query expansion
- natural language
- ad hoc information retrieval
- real time
- query terms
- language modelling
- retrieval model
- context sensitive
- computer vision
- pseudo relevance feedback
- smoothing methods
- context dependent
- target language
- co occurrence
- cross language retrieval
- document length
- bayesian networks
- language models for information retrieval