Login / Signup
Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synthesis.
Yixuan Zhou
Changhe Song
Xiang Li
Luwen Zhang
Zhiyong Wu
Yanyao Bian
Dan Su
Helen Meng
Published in:
INTERSPEECH (2022)
Keyphrases
</>
fine grained
speaker adaptation
speech recognition
text to speech synthesis
coarse grained
speaker dependent
maximum likelihood
automatic speech recognition
access control
user intent
metadata
speech recognizer
search engine
text to speech
multimedia
feature extraction