Login / Signup
MMICL: Empowering Vision-language Model with Multi-Modal In-Context Learning.
Haozhe Zhao
Zefan Cai
Shuzheng Si
Xiaojian Ma
Kaikai An
Liang Chen
Zixuan Liu
Sheng Wang
Wenjuan Han
Baobao Chang
Published in:
ICLR (2024)
Keyphrases
</>
multi modal
language model
context sensitive
language modeling
information retrieval
probabilistic model
document retrieval
cross modal
n gram
query expansion
speech recognition
multi modality
machine learning
low level
image annotation