Prosody Transfer in Neural Text to Speech Using Global Pitch and Loudness Features.
Siddharth GururaniKilol GuptaDhaval ShahZahra ShakeriJervis PintoPublished in: CoRR (2019)
Keyphrases
- text to speech
- speech synthesis
- prosodic features
- feature extraction
- low level
- co occurrence
- neural network
- programming tool
- feature set
- global features
- associative memory
- continuous wavelet transform
- neural classifier
- extracting features
- network architecture
- image features
- feature vectors
- high level
- image processing