Open-ended VQA benchmarking of Vision-Language models by exploiting Classification datasets and their semantic hierarchy.
Simon GingMaría Alejandra BravoThomas BroxPublished in: CoRR (2024)
Keyphrases
- language model
- open ended
- language modeling
- speech recognition
- document retrieval
- probabilistic model
- n gram
- test collection
- information retrieval
- classification accuracy
- language modelling
- query expansion
- decision trees
- statistical language models
- learning outcomes
- image database
- image classification
- feature vectors
- retrieval model
- context sensitive
- translation model
- language models for information retrieval
- vector space model
- hidden markov models
- document ranking
- smoothing methods
- learning process
- feature space
- ad hoc information retrieval
- statistical language modeling
- feature selection