A Comparative Study of the Performance of HMM, DNN, and RNN based Speech Synthesis Systems Trained on Very Large Speaker-Dependent Corpora.
Xin WangShinji TakakiJunichi YamagishiPublished in: SSW (2016)
Keyphrases
- speech recognition
- speech synthesis
- speaker dependent
- hidden markov models
- speaker independent
- nearest neighbor
- vocal tract
- recurrent neural networks
- pattern recognition
- speech signal
- speaker identification
- speech recognizer
- training process
- phoneme recognition
- speech recognition systems
- text to speech
- retrieval systems
- maximum likelihood
- knn
- low level
- training set