STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllable Neural Text to Speech.
Keon LeeKyumin ParkDaeyoung KimPublished in: Interspeech (2021)
Keyphrases
- text to speech
- speech synthesis
- prosodic features
- text to speech synthesis
- programming tool
- word processing
- multimodal interaction
- english text
- network architecture
- neural network
- decomposition methods
- writing skills
- decomposition algorithm
- neural model
- decomposition method
- bio inspired
- associative memory
- computational efficiency
- speech recognition
- multiscale