WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition.
Binbin ZhangHang LvPengcheng GuoQijie ShaoChao YangLei XieXin XuHui BuXiaoyu ChenChenchen ZengDi WuZhendong PengPublished in: CoRR (2021)
Keyphrases
- speech recognition
- multi domain
- speaker independent
- cross domain
- language model
- hidden markov models
- speech synthesis
- domain specific
- automatic speech recognition
- speech recognizer
- speech processing
- speech recognition systems
- speech signal
- speech recognition technology
- pattern recognition
- noisy environments
- speaker identification
- heterogeneous networks
- speaker dependent
- spontaneous speech
- conversational speech