DEEM: Diffusion Models Serve as the Eyes of Large Language Models for Image Perception.
Run LuoYunshui LiLongze ChenWanwei HeTing-En LinZiqiang LiuLei ZhangZikai SongXiaobo XiaTongliang LiuMin YangBinyuan HuiPublished in: CoRR (2024)
Keyphrases
- diffusion model
- diffusion models
- language model
- language modeling
- input image
- image data
- image content
- document retrieval
- multiscale
- probabilistic model
- image segmentation
- image retrieval
- n gram
- edge detection
- image analysis
- smoothing methods
- document ranking
- speech recognition
- query expansion
- objective function
- image processing
- video sequences