Login / Signup
Zero-Shot Multi-Speaker Text-To-Speech with State-Of-The-Art Neural Speaker Embeddings.
Erica Cooper
Cheng-I Lai
Yusuke Yasuda
Fuming Fang
Xin Wang
Nanxin Chen
Junichi Yamagishi
Published in:
ICASSP (2020)
Keyphrases
</>
prosodic features
text to speech
speech synthesis
speaker verification
speaker recognition
speech recognition
audio visual
neural network
programming tool
text to speech synthesis
network architecture
automatic speech recognition
bio inspired
word processing
low dimensional
software engineering
vector space
speaker diarization
multi modal
speaker dependent