Speech Detection By Facial Image For Multimodal Speech Recognition.
Kazumasa MuraiKen'ichi KumataniSatoshi NakamuraPublished in: ICME (2001)
Keyphrases
- speech recognition
- facial images
- speech signal
- noisy environments
- speech synthesis
- hidden markov models
- speech recognizer
- automatic speech recognition
- speech recognition technology
- speech processing
- pattern recognition
- language model
- speech recognition systems
- face recognition
- speaker identification
- voice activity detection
- speech recognizers
- detection method
- speaker independent
- facial expressions
- object detection
- detection algorithm
- speaker dependent
- recognition engine
- isolated word
- human faces
- speaker diarization
- keyword spotting
- multi modal
- noisy speech
- image database
- word error rate
- facial features
- text to speech
- machine learning
- image data
- acoustic models
- cepstral coefficients
- speech retrieval
- face detection
- audio visual