GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10, 000 Hours of Transcribed Audio.
Guoguo ChenShuzhou ChaiGuanbo WangJiayu DuWei-Qiang ZhangChao WengDan SuDaniel PoveyJan TrmalJunbo ZhangMingjie JinSanjeev KhudanpurShinji WatanabeShuaijiang ZhaoWei ZouXiangang LiXuchen YaoYongqing WangYujun WangZhao YouZhiyong YanPublished in: CoRR (2021)
Keyphrases
- multi domain
- spontaneous speech
- human machine interaction
- automatic speech recognition
- spoken language
- cross domain
- conversational speech
- domain specific
- search computing
- spoken document retrieval
- spoken dialogue systems
- broadcast news
- multimedia
- gesture recognition
- heterogeneous networks
- rbac model
- linguistic features
- speech recognition
- visual information
- feature selection
- role based access control
- neural network
- unsupervised learning
- co occurrence
- text mining