Login / Signup

VisionGPT: Vision-Language Understanding Agent Using Generalized Multimodal Framework.

Chris KellyLuhui HuBang YangYu TianDeshun YangCindy YangZaoshan HuangZihao LiJiayin HuYuexian Zou
Published in: CoRR (2024)
Keyphrases
  • language understanding
  • multi agent
  • machine learning
  • multi agent systems
  • domain specific
  • language processing
  • artificial intelligence
  • recommender systems
  • intelligent agents
  • natural language understanding