Waveform-Based Speaker Representations for Speech Synthesis.
Moquan WanGilles DegottexMark J. F. GalesPublished in: INTERSPEECH (2018)
Keyphrases
- speech synthesis
- speech recognition
- prosodic features
- vocal tract
- text to speech
- hidden markov models
- automatic speech recognition
- speaker identification
- language model
- speech signal
- speech corpus
- multi modal
- neural network
- symbolic representation
- audio visual
- image coding
- speaker verification
- vector quantization
- speaker diarization
- principal component analysis
- feature extraction
- speaker dependent
- case study