Login / Signup
ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models.
Mingrui Wu
Xinyue Cai
Jiayi Ji
Jiale Li
Oucheng Huang
Gen Luo
Hao Fei
Xiaoshuai Sun
Rongrong Ji
Published in:
CoRR (2024)
Keyphrases
</>
language model
language modeling
n gram
speech recognition
probabilistic model
supervised learning
multi modal
visual features
retrieval model
language model for information retrieval
multimedia
vector space model
visual information
document retrieval
query expansion
active learning
image retrieval