Robust front-end for audio, visual and audio-visual speech classification.
Lucas D. TerissiGonzalo D. SadJuan Carlos GómezPublished in: Int. J. Speech Technol. (2018)
Keyphrases
- audio visual
- audio visual speech recognition
- person authentication
- visual speech
- multi modal
- multi stream
- visual information
- noisy environments
- hidden markov models
- emotion recognition
- visual data
- classification accuracy
- pattern recognition
- speaker identification
- audio features
- multimedia
- feature extraction
- speaker verification
- image classification
- e learning
- text classification
- high level
- image retrieval
- speech recognition