MIBench: Evaluating Multimodal Large Language Models over Multiple Images.
Haowei LiuXi ZhangHaiyang XuYaya ShiChaoya JiangMing YanJi ZhangFei HuangChunfeng YuanBing LiWeiming HuPublished in: CoRR (2024)
Keyphrases
- multiple images
- language model
- language modeling
- single image
- n gram
- multiple views
- light source
- information retrieval
- language modelling
- probabilistic model
- document retrieval
- retrieval model
- multiple view geometry
- speech recognition
- statistical language models
- context sensitive
- ad hoc information retrieval
- test collection
- man made structures
- vector space model
- smoothing methods
- query expansion
- image processing
- multi modal
- translation model
- language models for information retrieval
- machine learning
- object shapes
- pseudo relevance feedback
- retrieval effectiveness