Using the focus of visual attention to improve spontaneous speech recognition.
Neil CookeMartin J. RussellPublished in: INTERSPEECH (2005)
Keyphrases
- speech recognition
- visual attention
- focus of attention
- hidden markov models
- saliency map
- speech synthesis
- eye tracking
- eye movements
- speech signal
- automatic speech recognition
- pattern recognition
- noisy environments
- speech recognizer
- language model
- speech processing
- speech recognition systems
- speech recognition technology
- visual attention model
- salient regions
- vision system
- speaker identification
- machine learning
- generative model
- feature vectors
- face recognition
- image processing
- real time
- object based visual attention