Generalizable Zero-Shot Speaker Adaptive Speech Synthesis with Disentangled Representations.

Wenbin Wang Yang Song Sanjay Jha

Published in: CoRR (2023)

Keyphrases

speech synthesis
speech recognition
prosodic features
vocal tract
text to speech
speech signal
speaker verification
speaker identification
automatic speech recognition
speaker dependent
language model
speaker diarization
real time
object categories
noisy environments
adaptive systems
data driven
object detection
hidden markov models
pattern recognition
data sets