Audio-visual interaction in sparse representation features for noise robust audio-visual speech recognition.
Peng ShenSatoshi TamuraSatoru HayamizuPublished in: AVSP (2013)
Keyphrases
- audio visual speech recognition
- audio visual
- sparse representation
- multi stream
- noisy environments
- multi modal
- audio features
- face recognition
- visual speech
- feature vectors
- high dimensional data
- visual information
- image classification
- visual data
- emotion recognition
- feature set
- noise reduction
- feature extraction
- signal processing
- dimensionality reduction
- co occurrence
- audio signal
- high level
- machine learning
- spatial information
- feature space
- multiscale
- computer vision