Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-Language Model.
Zipeng XuTianwei LinHao TangFu LiDongliang HeNicu SebeRadu TimofteLuc Van GoolErrui DingPublished in: CoRR (2021)
Keyphrases
- language model
- image manipulation
- pre trained
- information retrieval
- language modeling
- n gram
- image editing
- speech recognition
- probabilistic model
- computer vision
- query expansion
- training data
- text mining
- image processing
- mixture model
- translation model
- control signals
- image composition
- training examples
- image retrieval