Optimizing Alignment of Speech and Language Latent Spaces for End-to-End Speech Recognition and Understanding.
Wei WangShuo RenYao QianShujie LiuYu ShiYanmin QianMichael ZengPublished in: CoRR (2021)
Keyphrases
- speech recognition
- end to end
- isolated word
- speech synthesis
- automatic speech recognition
- speech signal
- speech processing
- language model
- speech recognizer
- speaker identification
- speech recognition technology
- hidden markov models
- noisy environments
- pattern recognition
- congestion control
- word error rate
- speaker dependent
- speech recognition systems
- speaker independent
- speaker recognition
- natural language
- machine learning
- information retrieval
- speaker verification
- language processing
- cepstral coefficients
- speech retrieval
- bayesian networks