Describe-then-Reason: Improving Multimodal Mathematical Reasoning through Visual Comprehension Training.
Mengzhao JiaZhihan ZhangWenhao YuFangkai JiaoMeng JiangPublished in: CoRR (2024)
Keyphrases
- human reasoning
- multimodal information
- cross modal
- visual information
- training phase
- concept mapping
- multi modal
- visual perception
- training process
- reasoning systems
- visual cues
- training set
- machine learning
- audio visual
- mathematical models
- temporal reasoning
- test set
- reasoning process
- online learning
- low level
- object recognition
- artificial intelligence