Enhancing CTC-based speech recognition with diverse modeling units.
Shiyi HanZhihong LeiMingbin XuXingyu NaZhen HuangPublished in: CoRR (2024)
Keyphrases
- speech recognition
- hidden markov models
- language model
- speech processing
- automatic speech recognition
- speech synthesis
- speech recognizer
- speech recognition technology
- speech signal
- handwriting recognition
- pattern recognition
- speech understanding
- speaker identification
- speech recognition systems
- keyword spotting
- noisy environments
- speaker independent
- speech retrieval
- isolated word
- speaker recognition
- speaker dependent
- speech recognition errors
- computer vision
- speech recognizers
- audio visual speech recognition