Training Multi-Speaker Neural Text-to-Speech Systems using Speaker-Imbalanced Speech Corpora.
Hieu-Thi LuongXin WangJunichi YamagishiNobuyuki NishizawaPublished in: CoRR (2019)
Keyphrases
- text to speech
- prosodic features
- speech synthesis
- speech recognition
- speaker verification
- automatic speech recognition systems
- speaker recognition
- vocal tract
- automatic speech recognition
- text to speech synthesis
- audio visual
- automatic transcription
- neural network
- speaker diarization
- programming tool
- speaker identification
- acoustic models
- language model
- online learning
- spontaneous speech
- word processing
- feature vectors
- speech signal
- english text
- speech recognition systems
- neural network model