End-to-end Audiovisual Speech Recognition.
Stavros PetridisThemos StafylakisPingchuan MaFeipeng CaiGeorgios TzimiropoulosMaja PanticPublished in: CoRR (2018)
Keyphrases
- end to end
- speech recognition
- language model
- automatic speech recognition
- speech recognizer
- speech processing
- hidden markov models
- pattern recognition
- speech signal
- speech recognition technology
- congestion control
- speech synthesis
- visual information
- video retrieval
- multimedia content
- speech recognition systems
- audio visual
- speech retrieval
- speaker identification
- noisy environments
- speech recognizers
- isolated word
- neural network
- speaker independent
- video sequences
- real world