ROME: Evaluating Pre-trained Vision-Language Models on Reasoning beyond Visual Common Sense.
Kankan ZhouEason LaiWei Bin Au YeongKyriakos MouratidisJing JiangPublished in: CoRR (2023)
Keyphrases
- language model
- pre trained
- language modeling
- probabilistic model
- n gram
- statistical language models
- speech recognition
- language modelling
- test collection
- retrieval model
- document retrieval
- smoothing methods
- information retrieval
- query expansion
- computer vision
- relevance model
- low level
- training examples
- neural network
- visual information
- training data
- data sets
- visual features
- real time
- face recognition
- learning algorithm
- image retrieval
- feature extraction
- language models for information retrieval