AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario.
Yihui FuLuyao ChengShubo LvYukai JvYuxiang KongZhuo ChenYanxin HuLei XieJian WuHui BuXin XuJun DuJingdong ChenPublished in: CoRR (2021)
Keyphrases
- speaker diarization
- speech enhancement
- noisy environments
- speech recognition
- speaker verification
- speech signal
- object recognition
- signal to noise ratio
- speaker identification
- noise reduction
- human activities
- vocal tract
- pattern recognition
- neural network
- background noise
- bayesian networks
- smoothing algorithm
- single channel
- broadcast news
- action recognition