Robust end-to-end deep audiovisual speech recognition.
Ramon SanabriaFlorian MetzeFernando De la TorrePublished in: CoRR (2016)
Keyphrases
- end to end
- speech recognition
- noisy environments
- hidden markov models
- speech processing
- language model
- automatic speech recognition
- speech recognizer
- pattern recognition
- speech synthesis
- congestion control
- text localization and recognition
- speaker identification
- speech signal
- speech recognition systems
- speech recognition technology
- isolated word
- neural network
- visual information
- computer vision
- speaker independent
- speaker dependent
- speech retrieval
- audio visual speech recognition
- machine learning