On The Usefulness of Self-Attention for Automatic Speech Recognition with Transformers.
Shucong ZhangErfan LoweimiPeter BellSteve RenalsPublished in: SLT (2021)
Keyphrases
- automatic speech recognition
- speech recognition
- speech signal
- word error rate
- hidden markov models
- conversational speech
- noisy environments
- speech retrieval
- broadcast news
- recognition errors
- word recognition
- spontaneous speech
- spoken words
- neural network
- computer vision
- acoustic features
- visual attention
- information retrieval
- speech sounds