An audio-visual corpus for multimodal speech recognition in dutch language.
Jacek C. WojdelPascal WiggersLéon J. M. RothkrantzPublished in: INTERSPEECH (2002)
Keyphrases
- audio visual
- speech recognition
- audio visual speech recognition
- multi modal
- multi stream
- isolated word
- visual information
- hidden markov models
- language model
- speaker verification
- visual data
- multimodal fusion
- natural language
- multimedia
- speech signal
- automatic speech recognition
- pattern recognition
- emotion recognition
- digit recognition
- noisy environments
- speaker identification
- neural network
- speech recognition systems
- language processing
- keywords
- image data
- probabilistic model
- mobile devices