Login / Signup
MMICL: Empowering Vision-language Model with Multi-Modal In-Context Learning.
Haozhe Zhao
Zefan Cai
Shuzheng Si
Xiaojian Ma
Kaikai An
Liang Chen
Zixuan Liu
Sheng Wang
Wenjuan Han
Baobao Chang
Published in:
CoRR (2023)
Keyphrases
</>
multi modal
language model
probabilistic model
context sensitive
audio visual
information retrieval
speech recognition
language modeling
n gram
multi modality
feature extraction
active learning
test collection
translation model