Streaming End-to-End Multilingual Speech Recognition with Joint Language Identification.
Chao ZhangBo LiTara N. SainathTrevor StrohmanSepand MavandadiShuo-Yiin ChangParisa HaghaniPublished in: CoRR (2022)
Keyphrases
- end to end
- speech recognition
- language identification
- speaker identification
- scalable video
- indian languages
- speech signal
- hidden markov models
- language model
- automatic speech recognition
- english text
- pattern recognition
- gaussian mixture model
- document images
- noisy environments
- speech recognition systems
- cross lingual
- language independent
- video streaming
- cross language
- machine learning
- feature vectors
- feature extraction
- image processing
- information retrieval