End-to-End Multimodal Speech Recognition.
Shruti PalaskarRamon SanabriaFlorian MetzePublished in: CoRR (2018)
Keyphrases
- end to end
- speech recognition
- hidden markov models
- speech synthesis
- pattern recognition
- automatic speech recognition
- speech processing
- language model
- speech signal
- congestion control
- speech recognizer
- multi modal
- speech recognition technology
- noisy environments
- speaker identification
- speech recognition systems
- speech retrieval
- computer vision
- speaker dependent
- maximum likelihood
- probabilistic model
- multimedia
- machine learning
- speech recognizers