Login / Signup

GeoGPT4V: Towards Geometric Multi-modal Large Language Models with Geometric Image Generation.

Shihao CaiKeqin BaoHangyu GuoJizhi ZhangJun SongBo Zheng
Published in: CoRR (2024)
Keyphrases
  • multi modal
  • language model
  • image generation
  • language modeling
  • probabilistic model
  • n gram
  • information retrieval
  • audio visual
  • video search
  • high resolution
  • low level
  • geometric information