Simulating Realistic Speech Overlaps Improves Multi-Talker ASR.
Muqiao YangNaoyuki KandaXiaofei WangJian WuSunit SivasankaranZhuo ChenJinyu LiTakuya YoshiokaPublished in: ICASSP (2023)
Keyphrases
- automatic speech recognition
- speech recognition
- speech signal
- noisy environments
- spontaneous speech
- speech corpus
- broadcast news
- real life
- spoken words
- word error rate
- speech synthesis
- hidden markov models
- computer vision
- facial animation
- pattern recognition
- acoustic features
- recognition errors
- speech retrieval
- image sequences
- case study
- language acquisition
- image processing
- human machine interaction
- audio visual
- probabilistic model
- genetic algorithm
- information retrieval
- real world
- database