From Images to Textual Prompts: Zero-shot VQA with Frozen Large Language Models.
Jiaxian GuoJunnan LiDongxu LiAnthony Meng Huat TiongBoyang LiDacheng TaoSteven C. H. HoiPublished in: CoRR (2022)
Keyphrases
- language model
- image database
- image data
- language modeling
- n gram
- image retrieval
- test collection
- probabilistic model
- speech recognition
- information retrieval
- image annotation
- relevance model
- image features
- image classification
- document retrieval
- retrieval model
- image understanding
- context sensitive
- language modelling
- image regions
- image collections
- hidden markov models
- statistical language models