II-Bench: An Image Implication Understanding Benchmark for Multimodal Large Language Models.
Ziqiang LiuFeiteng FangXi FengXinrun DuChenhao ZhangZekun WangYuelin BaiQixuan ZhaoLiyang FanChengguang GanHongquan LinJiaming LiYuansheng NiHaihong WuYaswanth NarsupalliZhigang ZhengChengming LiXiping HuRuifeng XuXiaojun ChenMin YangJiaheng LiuRuibo LiuWenhao HuangGe ZhangShiwen NiPublished in: CoRR (2024)
Keyphrases
- language model
- image features
- n gram
- language modeling
- probabilistic model
- image data
- image classification
- document retrieval
- image retrieval
- language modelling
- image segmentation
- smoothing methods
- test collection
- retrieval model
- image regions
- information retrieval
- image content
- context sensitive
- audio visual
- multi modal
- low level
- statistical language models
- speech recognition
- query terms
- image representation
- vector space model
- language model for information retrieval