Waveform-Based Speaker Representations for Speech Synthesis.

Moquan Wan Gilles Degottex Mark J. F. Gales

Published in: INTERSPEECH (2018)

Keyphrases

speech synthesis
speech recognition
prosodic features
vocal tract
text to speech
hidden markov models
automatic speech recognition
speaker identification
language model
speech signal
speech corpus
multi modal
neural network
symbolic representation
audio visual
image coding
speaker verification
vector quantization
speaker diarization
principal component analysis
feature extraction
speaker dependent
case study