Towards Multi-Scale Speaking Style Modelling with Hierarchical Context Information for Mandarin Speech Synthesis.
Shun LeiYixuan ZhouLiyang ChenJiankun HuZhiyong WuShiyin KangHelen MengPublished in: INTERSPEECH (2022)
Keyphrases
- speech synthesis
- speech recognition
- prosodic features
- multiscale
- coarse to fine
- text to speech
- vocal tract
- language model
- automatic speech recognition
- hidden markov models
- edge detection
- scale space
- speaker independent
- image processing
- wavelet transform
- conversational agents
- hierarchical classification
- hierarchical clustering
- natural images
- pattern recognition
- image segmentation
- speech corpus
- speaker identification
- real time
- broadcast news
- hierarchical model
- multiple scales
- local binary pattern
- image compression
- information retrieval
- machine learning
- neural network