VIDiff: Translating Videos via Multi-Modal Instructions with Diffusion Models.
Zhen XingQi DaiZihao ZhangHui ZhangHan HuZuxuan WuYu-Gang JiangPublished in: CoRR (2023)
Keyphrases
- multi modal
- diffusion models
- video search
- diffusion model
- information diffusion
- semantic concepts
- social networks
- video sequences
- multi modality
- video analysis
- cross modal
- audio visual
- influence maximization
- video frames
- high dimensional
- video content
- viral marketing
- image enhancement
- video data
- dynamic programming
- uni modal