Generalizable Zero-Shot Speaker Adaptive Speech Synthesis with Disentangled Representations.
Wenbin WangYang SongSanjay JhaPublished in: INTERSPEECH (2023)
Keyphrases
- speech synthesis
- speech recognition
- prosodic features
- vocal tract
- text to speech
- hidden markov models
- speaker identification
- automatic speech recognition
- speaker verification
- speech signal
- pattern recognition
- neural network
- noisy environments
- adaptive systems
- language model
- speaker dependent
- speech corpus
- representation scheme
- data driven
- similarity measure
- data mining