Cross-Modal Contextualized Diffusion Models for Text-Guided Visual Generation and Editing.
Ling YangZhilong ZhangZhaochen YuJingwei LiuMinkai XuStefano ErmonBin CuiPublished in: CoRR (2024)
Keyphrases
- cross modal
- diffusion models
- multi modal
- diffusion model
- multimedia retrieval
- image retrieval
- information diffusion
- perceptual information
- text retrieval
- visual similarity
- information retrieval
- visual data
- visual recognition
- text mining
- social networks
- multimedia databases
- web images
- object recognition
- viral marketing
- influence maximization
- keywords
- multimedia