Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis.
Shivam MehtaSiyang WangSimon AlexandersonJonas BeskowÉva SzékelyGustav Eje HenterPublished in: SSW (2023)
Keyphrases
- denoising
- image denoising
- multimodal interfaces
- total variation
- gesture recognition
- denoising algorithm
- natural images
- noisy images
- speech recognition
- bayesian networks
- probabilistic model
- hand movements
- wavelet domain
- noise removal
- image processing
- speech synthesis
- program synthesis
- automatic speech recognition
- audio visual
- gaussian noise
- pattern recognition
- hand gestures
- wavelet packet
- hidden markov models
- posterior probability
- dialogue system
- emotion recognition
- probabilistic logic
- hierarchical data
- image sequences
- language model
- image compression
- wavelet denoising
- human computer interaction