Monotonic segmental attention for automatic speech recognition.
Albert ZeyerRobin SchmittWei ZhouRalf SchlüterHermann NeyPublished in: CoRR (2022)
Keyphrases
- automatic speech recognition
- hidden markov models
- speech recognition
- conversational speech
- broadcast news
- word error rate
- speech signal
- speech retrieval
- recognition errors
- visual attention
- noisy environments
- word recognition
- spoken words
- speech corpus
- spontaneous speech
- focus of attention
- acoustic features
- neural network
- language model
- multiscale