InterFormer: Interactive Local and Global Features Fusion for Automatic Speech Recognition.
Zhi-Hao LaiTian-Hao ZhangQi LiuXinyuan QianLi-Fang WeiSong-Lu ChenFeng ChenXu-Cheng YinPublished in: CoRR (2023)
Keyphrases
- automatic speech recognition
- global features
- speech recognition
- visual features
- keypoints
- speech signal
- hidden markov models
- word error rate
- feature vectors
- conversational speech
- image features
- spoken words
- speech retrieval
- broadcast news
- acoustic features
- noisy environments
- recognition errors
- speech corpus
- word recognition
- classification method
- spontaneous speech