Emphatic Speech Prosody Prediction with Deep Lstm Networks.
Slava ShechtmanMoran MordechayPublished in: ICASSP (2018)
Keyphrases
- text to speech
- speech synthesis
- prediction accuracy
- linear prediction
- speech recognition
- audio visual
- multi stream
- prediction model
- prediction algorithm
- network design
- network structure
- neural network ensemble
- prediction error
- network model
- motion estimation
- prosodic features
- data sets
- endpoint detection
- noisy environments
- heterogeneous networks
- network analysis
- recurrent neural networks
- language model
- genetic algorithm
- information retrieval