Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study.
Xuankai ChangBrian YanKwanghee ChoiJee-Weon JungYichen LuSoumi MaitiRoshan S. SharmaJiatong ShiJinchuan TianShinji WatanabeYuya FujitaTakashi MaekakuPengcheng GuoYao-Fei ChengPavel DenisovKohei SaijoHsiu-Hsuan WangPublished in: CoRR (2023)
Keyphrases
- speech recognition
- speech signal
- automatic speech recognition
- speech synthesis
- speech recognizer
- hidden markov models
- language model
- pattern recognition
- speech processing
- speech recognition technology
- noisy environments
- speech recognition systems
- speech recognizers
- machine translation
- speaker independent
- isolated word
- word error rate
- keyword spotting
- speaker identification
- handwriting recognition
- speech recognition errors
- machine learning
- speaker diarization
- recognition engine
- cepstral coefficients
- speaker dependent
- speech retrieval
- cross language information retrieval
- noisy speech
- information retrieval