Login / Signup
Multi-modal Instruction Tuned LLMs with Fine-grained Visual Perception.
Junwen He
Yifan Wang
Lijun Wang
Huchuan Lu
Jun-Yan He
Jin-Peng Lan
Bin Luo
Xuansong Xie
Published in:
CoRR (2024)
Keyphrases
</>
multi modal
visual perception
fine grained
coarse grained
visual attention
access control
high dimensional
multi modality
audio visual
cross modal
multimedia
real time
computer vision
multiscale
data management