Streaming end-to-end speech recognition with jointly trained neural feature enhancement.
Chanwoo KimAbhinav GargDhananjaya GowdaSeongkyu MunChangwoo HanPublished in: CoRR (2021)
Keyphrases
- end to end
- speech recognition
- scalable video
- isolated word
- hidden markov models
- language model
- automatic speech recognition
- rate adaptation
- speech signal
- speech synthesis
- speech processing
- speech recognizer
- cepstral coefficients
- speaker identification
- pattern recognition
- noisy environments
- neural network
- speech recognition technology
- network architecture
- congestion control
- image processing
- feature vectors
- speech recognition systems
- content delivery
- computer vision
- image quality
- information retrieval