Generalizable Zero-Shot Speaker Adaptive Speech Synthesis with Disentangled Representations.

Wenbin Wang Yang Song Sanjay Jha

Published in: INTERSPEECH (2023)

Keyphrases

speech synthesis
speech recognition
prosodic features
vocal tract
text to speech
hidden markov models
speaker identification
automatic speech recognition
speaker verification
speech signal
pattern recognition
neural network
noisy environments
adaptive systems
language model
speaker dependent
speech corpus
representation scheme
data driven
similarity measure
data mining