Automatic Speech Recognition Datasets in Cantonese: A Survey and New Dataset.
Tiezheng YuRita FrieskePeng XuSamuel CahyawijayaCheuk Tung Shadow YiuHoly LoveniaWenliang DaiElham J. BareziQifeng ChenXiaojuan MaBertram E. ShiPascale FungPublished in: CoRR (2022)
Keyphrases
- automatic speech recognition
- speech recognition
- benchmark datasets
- synthetic datasets
- training dataset
- hidden markov models
- speech signal
- conversational speech
- speech retrieval
- word error rate
- speech corpus
- broadcast news
- spoken words
- word recognition
- noisy environments
- handwriting recognition
- spontaneous speech
- cl sr
- multi modal
- language model
- information retrieval systems
- machine learning