Bytes are All You Need: End-to-End Multilingual Speech Recognition and Synthesis with Bytes.
Bo LiYu ZhangTara N. SainathYonghui WuWilliam ChanPublished in: CoRR (2018)
Keyphrases
- end to end
- speech recognition
- hidden markov models
- language model
- speech processing
- pattern recognition
- speech signal
- speech recognizer
- automatic speech recognition
- speech recognition technology
- congestion control
- speech recognition systems
- speech synthesis
- speaker identification
- digital libraries
- noisy environments
- speaker independent
- speaker dependent
- image processing
- speech retrieval
- speech recognizers
- isolated word
- audio visual speech recognition
- text localization and recognition