A unified approach to multi-pose audio-visual ASR.
Patrick LuceyGerasimos PotamianosSridha SridharanPublished in: INTERSPEECH (2007)
Keyphrases
- audio visual
- multi modal
- visual information
- visual data
- multi stream
- temporal context
- emotion recognition
- multimedia
- speech recognition
- person authentication
- pose estimation
- automatic speech recognition
- data sets
- hidden markov models
- d objects
- multimodal fusion
- dimensionality reduction
- domain knowledge
- feature selection
- databases