Multimodal Speech Synthesis Architecture for Unsupervised Speaker Adaptation.

Hieu-Thi Luong Junichi Yamagishi

Published in: INTERSPEECH (2018)

Keyphrases

speech synthesis
speech recognition
speaker adaptation
vocal tract
speaker independent
speech recognizer
speaker dependent
automatic speech recognition
prosodic features
text to speech
multi modal
hidden markov models
language model
image processing
speech signal
information retrieval
unsupervised learning
pattern recognition
multimedia
speaker identification