Synthesizing Personalized Non-speech Vocalization from Discrete Speech Representations.
Chin-Cheng HsuPublished in: CoRR (2022)
Keyphrases
- speech recognition
- speech signal
- speaker recognition
- automatic speech recognition
- e learning
- speech synthesis
- text to speech
- spoken language
- language acquisition
- audio visual
- bayesian networks
- data sets
- user profiles
- multi modal
- noisy environments
- hidden markov models
- learning environment
- broadcast news
- speaker identification
- website
- speech processing
- vocal tract
- endpoint detection