Machine Vision Therapy: Multimodal Large Language Models Can Enhance Visual Robustness via Denoising In-Context Learning.
Zhuo HuangChang LiuYinpeng DongHang SuShibao ZhengTongliang LiuPublished in: CoRR (2023)
Keyphrases
- machine vision
- language model
- denoising
- context sensitive
- n gram
- image processing
- language modeling
- character recognition
- vision system
- speech recognition
- multi modal
- active learning
- statistical language models
- surface inspection
- visual information
- retrieval model
- information retrieval
- probabilistic model
- language model for information retrieval