Speech-to-Face Conversion Using Denoising Diffusion Probabilistic Models.
Shuhei KatoTaiichi HashimotoPublished in: INTERSPEECH (2023)
Keyphrases
- denoising
- probabilistic model
- nonlinear diffusion
- diffusion processes
- recognition engine
- image denoising
- graphical models
- speech recognition
- total variation
- natural images
- language model
- denoising algorithm
- noisy images
- anisotropic diffusion
- diffusion process
- audio visual
- generative model
- gaussian noise
- wavelet domain
- hidden variables
- bayesian networks
- translation invariant
- noise removal
- speech signal
- face images
- latent variables
- facial features
- conditional random fields
- recognition algorithm
- facial images
- wavelet packet
- speech synthesis
- facial expressions
- human faces
- facial animation
- face recognition
- spoken language
- video sequences
- diffusion model
- dialogue system
- information extraction
- image processing