Low-Resource Mongolian Speech Synthesis Based on Automatic Prosody Annotation.
Xin YuanRobin FengMingming YePublished in: CoRR (2022)
Keyphrases
- speech synthesis
- speech recognition
- text to speech
- manual annotation
- automatic annotation
- vocal tract
- prosodic features
- automatic indexing
- fully automatic
- labor intensive
- semantic annotation
- data driven
- annotation tool
- image annotation
- hand crafted
- semi automatically
- speech corpus
- high levels
- automatic image annotation
- database
- semi automatic
- hidden markov models
- active learning
- speech signal
- peer to peer
- image retrieval
- metadata