GPT-4o: Visual perception performance of multimodal large language models in piglet activity understanding.
Yiqi WuXiaodan HuZiming FuSiling ZhouJiangong LiPublished in: CoRR (2024)
Keyphrases
- language model
- visual perception
- language modeling
- document retrieval
- n gram
- probabilistic model
- language modelling
- query expansion
- visual attention
- retrieval model
- information retrieval
- speech recognition
- mixture model
- ad hoc information retrieval
- statistical language models
- test collection
- multi modal
- query terms
- smoothing methods
- context sensitive
- vector space model
- language models for information retrieval
- statistical language modeling
- language model for information retrieval
- document ranking
- translation model
- relevance model
- text mining
- low level
- high level