Weighting schemes for audio-visual fusion in speech recognition.
Hervé GlotinD. VergyrChalapathy NetiGerasimos PotamianosJuergen LuettinPublished in: ICASSP (2001)
Keyphrases
- audio visual
- speech recognition
- audio visual speech recognition
- multi modal
- multi stream
- visual information
- hidden markov models
- language model
- speech signal
- tf idf
- pattern recognition
- visual data
- noisy environments
- multimedia
- audio features
- nearest neighbor
- image classification
- contextual information
- low level
- speaker identification