Sign in

MathVista: Evaluating Math Reasoning in Visual Contexts with GPT-4V, Bard, and Other Large Multimodal Models.

Pan LuHritik BansalTony XiaJiacheng LiuChunyuan LiHannaneh HajishirziHao ChengKai-Wei ChangMichel GalleyJianfeng Gao
Published in: CoRR (2023)
Keyphrases
  • model selection
  • model construction
  • cross modal
  • reasoning processes
  • visual tasks
  • knowledge base
  • hidden markov models
  • multi modal
  • video data
  • complex systems
  • visual data
  • formal models