Login / Signup
VisionGPT: Vision-Language Understanding Agent Using Generalized Multimodal Framework.
Chris Kelly
Luhui Hu
Bang Yang
Yu Tian
Deshun Yang
Cindy Yang
Zaoshan Huang
Zihao Li
Jiayin Hu
Yuexian Zou
Published in:
CoRR (2024)
Keyphrases
</>
language understanding
multi agent
machine learning
multi agent systems
domain specific
language processing
artificial intelligence
recommender systems
intelligent agents
natural language understanding