Towards Improving NAM-to-Speech Synthesis Intelligibility using Self-Supervised Speech Models.

Neil Kumar Shah Shirish Karande Vineet Gandhi

Published in: CoRR (2024)

Keyphrases

speech synthesis
speech recognition
text to speech
vocal tract
prosodic features
probabilistic model
statistical models
machine learning
computer vision
case study
speech signal
model selection
noisy environments
complex systems
neural network
artificial neural networks
pattern recognition
bayesian networks
data mining