Login / Signup
IT: A Large-Scale Dataset towards Multi-Modal Multilingual Instruction Tuning.
Lei Li
Yuwei Yin
Shicheng Li
Liang Chen
Peiyi Wang
Shuhuai Ren
Mukai Li
Yazheng Yang
Jingjing Xu
Xu Sun
Lingpeng Kong
Qi Liu
Published in:
CoRR (2023)
Keyphrases
</>
multi modal
cross modal
multi modality
audio visual
semantic concepts
high dimensional
low level
video search
computer vision
multimedia
object recognition