SUR-adapter: Enhancing Text-to-Image Pre-trained Diffusion Models with Large Language Models.
Shanshan ZhongZhongzhan HuangWushao WenJinghui QinLiang LinPublished in: ACM Multimedia (2023)
Keyphrases
- pre trained
- language model
- information retrieval
- image segmentation
- language modeling
- probabilistic model
- n gram
- input image
- image features
- image classification
- diffusion models
- image content
- image retrieval
- query expansion
- retrieval model
- high resolution
- image representation
- image analysis
- speech recognition
- multiscale
- training data
- edge detection
- training examples
- lighting conditions
- target object
- diffusion model
- feature selection