Sign in

AssistGPT: A General Multi-modal Assistant that can Plan, Execute, Inspect, and Learn.

Difei GaoLei JiLuowei ZhouKevin Qinghong LinJoya ChenZihan FanMike Zheng Shou
Published in: CoRR (2023)
Keyphrases
  • multi modal
  • multi modality
  • audio visual
  • image processing
  • semantic concepts
  • high dimensional
  • image annotation
  • low level
  • image search
  • cross modal
  • multiple modalities