Perception of congruent and incongruent audiovisual speech stimuli.
Jintao JiangLynne E. BernsteinEdward T. Auer Jr.Published in: AVSP (2005)
Keyphrases
- audio visual
- emotion recognition
- speech recognition
- multi modal
- emotional state
- visual information
- speech synthesis
- multi stream
- visual stimuli
- human perception
- speech signal
- spoken language
- multimedia
- lexical features
- text to speech
- brain activity
- dialogue system
- visual data
- video clips
- video retrieval
- multimedia content
- neural network
- noisy environments
- visual perception
- language acquisition
- automatic speech recognition
- broadcast news
- speaker identification
- spoken dialogue systems
- human computer interaction
- external world
- endpoint detection