Monotonic Gaussian regularization of attention for robust automatic speech recognition.
Ye-Qian DuMing-Hui WuXin FangZhou-Wang YangPublished in: Comput. Speech Lang. (2023)
Keyphrases
- automatic speech recognition
- speech recognition
- speech signal
- noisy environments
- word error rate
- hidden markov models
- conversational speech
- broadcast news
- speech retrieval
- speech corpus
- regularization parameter
- video sequences
- recognition errors
- speech recognizer
- acoustic features
- word recognition
- maximum likelihood
- information retrieval