A Neural Vocoder With Hierarchical Generation of Amplitude and Phase Spectra for Statistical Parametric Speech Synthesis.
Yang AiZhen-Hua LingPublished in: IEEE ACM Trans. Audio Speech Lang. Process. (2020)
Keyphrases
- speech synthesis
- speech recognition
- text to speech
- phase information
- prosodic features
- network architecture
- vocal tract
- kernel density estimators
- generation process
- neural network
- statistical analysis
- data driven
- hierarchical clustering
- associative memory
- information theoretic
- statistical inference
- biologically inspired
- automatic speech recognition
- parametric models
- wavelet domain
- hierarchical structure
- principal component analysis
- learning algorithm