Low-Resource Speech Synthesis with Speaker-Aware Embedding.
Li-Jen YangI-Ping YehJen-Tzung ChienPublished in: ISCSLP (2022)
Keyphrases
- speech synthesis
- speech recognition
- prosodic features
- vocal tract
- text to speech
- automatic speech recognition
- pattern recognition
- hidden markov models
- high levels
- language model
- resource allocation
- vector space
- web resources
- speaker identification
- speech signal
- multi modal
- data mining
- speaker recognition
- data sets
- resource management
- speaker diarization