WENETSPEECH: A 10000+ Hours Multi-Domain Mandarin Corpus for Speech Recognition.
Binbin ZhangHang LvPengcheng GuoQijie ShaoChao YangLei XieXin XuHui BuXiaoyu ChenChenchen ZengDi WuZhendong PengPublished in: ICASSP (2022)
Keyphrases
- speech recognition
- multi domain
- hidden markov models
- speaker independent
- language model
- cross domain
- domain specific
- speech processing
- speech signal
- automatic speech recognition
- speech synthesis
- speech recognizer
- speech recognition technology
- speaker identification
- noisy environments
- conversational speech
- pattern recognition
- general purpose
- spontaneous speech
- speech recognition systems
- machine learning
- broadcast news
- data mining
- link prediction
- active learning
- speaker adaptation
- speaker dependent