Login / Signup
Auto-Encoding Morph-Tokens for Multimodal LLM.
Kaihang Pan
Siliang Tang
Juncheng Li
Zhaoyu Fan
Wei Chow
Shuicheng Yan
Tat-Seng Chua
Yueting Zhuang
Hanwang Zhang
Published in:
CoRR (2024)
Keyphrases
</>
multi modal
multimodal function optimization
data mining
brain image analysis
multimodal interaction
audio visual
databases
multi agent
vector quantization
line segments
three dimensional
encoding scheme
image segmentation
music retrieval
multiple modalities
multimodal information
multimedia