AVATAR: Unconstrained Audiovisual Speech Recognition.
Valentin GabeurPaul Hongsuck SeoArsha NagraniChen SunKarteek AlahariCordelia SchmidPublished in: CoRR (2022)
Keyphrases
- speech recognition
- hidden markov models
- speech recognizer
- speech synthesis
- language model
- speech signal
- automatic speech recognition
- speech processing
- speech understanding
- pattern recognition
- speaker identification
- handwriting recognition
- noisy environments
- video retrieval
- multimedia content
- visual information
- speaker independent
- speech recognition technology
- audio visual
- neural network
- keyword spotting
- speech retrieval
- speech recognizers
- speech recognition systems
- low level
- speaker dependent