Login / Signup
LAMM: Language-Assisted Multi-Modal Instruction-Tuning Dataset, Framework, and Benchmark.
Zhenfei Yin
Jiong Wang
Jianjian Cao
Zhelun Shi
Dingning Liu
Mukai Li
Lu Sheng
Lei Bai
Xiaoshui Huang
Zhiyong Wang
Jing Shao
Wanli Ouyang
Published in:
CoRR (2023)
Keyphrases
</>
multi modal
cross modal
multi modality
audio visual
multiple modalities
language learning
machine learning
image segmentation
high level
feature extraction