Sign in

MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation.

Ludan RuanYiyang MaHuan YangHuiguo HeBei LiuJianlong FuNicholas Jing YuanQin JinBaining Guo
Published in: CoRR (2022)
Keyphrases
  • multi modal
  • diffusion models
  • cross modal
  • audio visual
  • semantic concepts
  • multimedia
  • video search
  • diffusion model
  • information diffusion
  • video sequences
  • information processing
  • video frames
  • visual data