Streaming end-to-end speech recognition with jointly trained neural feature enhancement.

Chanwoo Kim Abhinav Garg Dhananjaya Gowda Seongkyu Mun Changwoo Han

Published in: CoRR (2021)

Keyphrases

end to end
speech recognition
scalable video
isolated word
hidden markov models
language model
automatic speech recognition
rate adaptation
speech signal
speech synthesis
speech processing
speech recognizer
cepstral coefficients
speaker identification
pattern recognition
noisy environments
neural network
speech recognition technology
network architecture
congestion control
image processing
feature vectors
speech recognition systems
content delivery
computer vision
image quality
information retrieval