Multimodal Speech Synthesis Architecture for Unsupervised Speaker Adaptation.
Hieu-Thi LuongJunichi YamagishiPublished in: INTERSPEECH (2018)
Keyphrases
- speech synthesis
- speech recognition
- speaker adaptation
- vocal tract
- speaker independent
- speech recognizer
- speaker dependent
- automatic speech recognition
- prosodic features
- text to speech
- multi modal
- hidden markov models
- language model
- image processing
- speech signal
- information retrieval
- unsupervised learning
- pattern recognition
- multimedia
- speaker identification