Visual information and redundancy conveyed by internal articulator dynamics in synthetic audiovisual speech.
Katja GrauwinkelBritta DewittSascha FagelPublished in: INTERSPEECH (2007)
Keyphrases
- visual information
- audio visual
- audio visual speech recognition
- multi stream
- visual features
- visual data
- emotion recognition
- low level
- visual content
- visual cues
- speech recognition
- eye movements
- textual information
- low level features
- semantic information
- visual descriptors
- databases
- visual information retrieval
- content based image retrieval systems
- image collections
- visual concepts
- high dimensional
- object recognition
- high level
- feature selection
- artificial intelligence