Streaming end-to-end multi-talker speech recognition.
Liang LuNaoyuki KandaJinyu LiYifan GongPublished in: CoRR (2020)
Keyphrases
- end to end
- speech recognition
- scalable video
- language model
- automatic speech recognition
- hidden markov models
- speech processing
- rate adaptation
- speech synthesis
- speech recognition systems
- noisy environments
- speech recognition technology
- speech signal
- speech recognizer
- congestion control
- application layer
- pattern recognition
- speaker independent
- speaker identification
- content delivery
- speaker dependent
- speech recognizers
- data streams
- multimedia
- audio visual speech recognition
- isolated word
- video sequences