NPHardEval4V: A Dynamic Reasoning Benchmark of Multimodal Large Language Models.
Lizhou FanWenyue HuaXiang LiKaijie ZhuMingyu JinLingyao LiHaoyang LingJinkui ChiJindong WangXin MaYongfeng ZhangPublished in: CoRR (2024)
Keyphrases
- language model
- language modeling
- n gram
- document retrieval
- probabilistic model
- information retrieval
- speech recognition
- language modelling
- ad hoc information retrieval
- smoothing methods
- test collection
- retrieval model
- context sensitive
- query terms
- query expansion
- relevance model
- statistical language models
- vector space model
- translation model
- multi modal
- language models for information retrieval
- cross lingual
- multimedia
- language model for information retrieval
- machine learning
- document ranking
- search engine