Automatic Speech Recognition Datasets in Cantonese: A Survey and New Dataset.
Tiezheng YuRita FrieskePeng XuSamuel CahyawijayaCheuk Tung Shadow YiuHoly LoveniaWenliang DaiElham J. BareziQifeng ChenXiaojuan MaBertram E. ShiPascale FungPublished in: LREC (2022)
Keyphrases
- automatic speech recognition
- speech recognition
- benchmark datasets
- synthetic datasets
- training dataset
- speech signal
- word error rate
- hidden markov models
- speech retrieval
- conversational speech
- broadcast news
- noisy environments
- spoken words
- speech corpus
- recognition errors
- word recognition
- document retrieval
- error rate
- natural images
- information retrieval systems