Eyes Can Deceive: Benchmarking Counterfactual Reasoning Abilities of Multi-modal Large Language Models.
Yian LiWentao TianYang JiaoJingjing ChenYu-Gang JiangPublished in: CoRR (2024)
Keyphrases
- multi modal
- language model
- language modeling
- n gram
- language modelling
- document retrieval
- probabilistic model
- statistical language models
- speech recognition
- information retrieval
- multi modality
- query expansion
- audio visual
- test collection
- retrieval model
- relevance model
- high dimensional
- smoothing methods
- document ranking
- language models for information retrieval
- pseudo relevance feedback
- video search
- translation model
- image annotation
- cross modal
- co occurrence
- context sensitive
- image representation
- image classification
- video sequences
- feature selection