Generalizable Zero-Shot Speaker Adaptive Speech Synthesis with Disentangled Representations.
Wenbin WangYang SongSanjay JhaPublished in: CoRR (2023)
Keyphrases
- speech synthesis
- speech recognition
- prosodic features
- vocal tract
- text to speech
- speech signal
- speaker verification
- speaker identification
- automatic speech recognition
- speaker dependent
- language model
- speaker diarization
- real time
- object categories
- noisy environments
- adaptive systems
- data driven
- object detection
- hidden markov models
- pattern recognition
- data sets