Improved Prosodic Clustering for Multispeaker and Speaker-independent Phoneme-level Prosody Control.
Myrsini ChristidouAlexandra VioniNikolaos EllinasGeorgios VamvoukakisKonstantinos MarkopoulosPanos KakoulidisJune Sig SungHyoungmin ParkAimilios ChalamandarisPirros TsiakoulisPublished in: CoRR (2021)
Keyphrases
- speech recognition
- speech synthesis
- speaker independent
- speaker dependent
- prosodic features
- hidden markov models
- speech recognizer
- clustering algorithm
- automatic speech recognition
- noisy environments
- pattern recognition
- speech recognition systems
- speech signal
- language model
- text to speech
- vocal tract
- speaker identification
- k means
- speaker diarization
- visual features
- unsupervised learning
- synthesized speech