Login / Signup
Task-Oriented Multi-Modal Mutual Learning for Vision-Language Models.
Sifan Long
Zhen Zhao
Junkun Yuan
Zichang Tan
Jiangjiang Liu
Luping Zhou
Shengsheng Wang
Jingdong Wang
Published in:
ICCV (2023)
Keyphrases
</>
multi modal
language model
speech recognition
multi modality
n gram
language modeling
document retrieval
high level
test collection
retrieval model
statistical language models
information retrieval
video search
audio visual
query expansion
probabilistic model
image retrieval
high dimensional
multimedia