Login / Signup
WorldGPT: Empowering LLM as Multimodal World Model.
Zhiqi Ge
Hongzhe Huang
Mingze Zhou
Juncheng Li
Guoming Wang
Siliang Tang
Yueting Zhuang
Published in:
CoRR (2024)
Keyphrases
</>
world model
vision system
semantic interpretation
multi modal
multimodal data
multimodal interaction
multimedia
brain image analysis
general purpose
image analysis
intraoperative
audio visual
training data
information systems
computer vision
genetic algorithm
multimodal information
data sets