BSTC: A Large-Scale Chinese-English Speech Translation Dataset.
Ruiqing ZhangXiyang WangChuanqiang ZhangZhongjun HeHua WuZhi LiHaifeng WangYing ChenQinfei LiPublished in: CoRR (2021)
Keyphrases
- text collections
- chinese english
- linguistic resources
- information retrieval
- cross language retrieval
- cross lingual information retrieval
- machine translation
- speech recognition
- cross language information retrieval
- information extraction
- speech signal
- statistical model
- automatic speech recognition
- statistical machine translation
- machine translation system
- broadcast news
- search engine
- artificial intelligence