Constructing Multilingual Visual-Text Datasets Revealing Visual Multilingual Ability of Vision Language Models.
Jesse AtuhurraIqra AliTatsuya HiraokaHidetaka KamigaitoTomoya IwakuraTaro WatanabePublished in: CoRR (2024)
Keyphrases
- language model
- language modeling
- probabilistic model
- information retrieval
- visual information
- visual features
- language independent
- cross lingual
- bayesian networks
- statistical language models
- document level
- text retrieval
- speech recognition
- digital libraries
- retrieval model
- test collection
- cross language
- video search
- n gram
- language modelling