Towards Improving NAM-to-Speech Synthesis Intelligibility using Self-Supervised Speech Models.
Neil Kumar ShahShirish KarandeVineet GandhiPublished in: CoRR (2024)
Keyphrases
- speech synthesis
- speech recognition
- text to speech
- vocal tract
- prosodic features
- probabilistic model
- statistical models
- machine learning
- computer vision
- case study
- speech signal
- model selection
- noisy environments
- complex systems
- neural network
- artificial neural networks
- pattern recognition
- bayesian networks
- data mining