Zero-Shot Multi-Speaker Text-To-Speech with State-Of-The-Art Neural Speaker Embeddings.
Erica CooperCheng-I LaiYusuke YasudaFuming FangXin WangNanxin ChenJunichi YamagishiPublished in: ICASSP (2020)
Keyphrases
- prosodic features
- text to speech
- speech synthesis
- speaker verification
- speaker recognition
- speech recognition
- audio visual
- neural network
- programming tool
- text to speech synthesis
- network architecture
- automatic speech recognition
- bio inspired
- word processing
- low dimensional
- software engineering
- vector space
- speaker diarization
- multi modal
- speaker dependent