Linguistic-Enhanced Transformer with CTC Embedding for Speech Recognition.
Xulong ZhangJianzong WangNing ChengMengyuan ZhaoZhiyong ZhangJing XiaoPublished in: CoRR (2022)
Keyphrases
- speech recognition
- hidden markov models
- language model
- speech processing
- speech recognition technology
- pattern recognition
- automatic speech recognition
- speech recognizer
- speech understanding
- keyword spotting
- noisy environments
- speaker identification
- speech synthesis
- machine learning
- natural language
- speech recognizers
- speech signal
- speaker independent
- cepstral coefficients
- speaker dependent
- feature selection
- information retrieval