GigaSpeech: An Evolving, Multi-Domain ASR Corpus with 10, 000 Hours of Transcribed Audio.
Guoguo ChenShuzhou ChaiGuan-Bo WangJiayu DuWei-Qiang ZhangChao WengDan SuDaniel PoveyJan TrmalJunbo ZhangMingjie JinSanjeev KhudanpurShinji WatanabeShuaijiang ZhaoWei ZouXiangang LiXuchen YaoYongqing WangZhao YouZhiyong YanPublished in: Interspeech (2021)
Keyphrases
- multi domain
- spontaneous speech
- automatic speech recognition
- human machine interaction
- spoken language
- cross domain
- conversational speech
- domain specific
- broadcast news
- spoken document retrieval
- spoken dialogue systems
- search computing
- speech recognition
- gesture recognition
- hidden markov models
- multimedia
- role based access control
- k nearest neighbor
- speech signal
- feature set
- nearest neighbor
- video material
- machine learning