Sign in

ShareGPT4V: Improving Large Multi-Modal Models with Better Captions.

Lin ChenJinsong LiXiaoyi DongPan ZhangConghui HeJiaqi WangFeng ZhaoDahua Lin
Published in: CoRR (2023)
Keyphrases
  • multi modal
  • audio visual
  • high dimensional
  • cross modal
  • multi modality
  • semantic concepts
  • metadata
  • medical images
  • statistical analysis
  • video search
  • uni modal