Audio-Visual Speech Recognition Using MPEG-4 Compliant Visual Features.
Petar S. AleksicJay J. WilliamsZhilin WuAggelos K. KatsaggelosPublished in: EURASIP J. Adv. Signal Process. (2002)
Keyphrases
- audio visual
- visual features
- visual information
- visual descriptors
- visual data
- visual content
- multimedia
- image classification
- low level
- image retrieval
- audio features
- emotion recognition
- multi modal
- image annotation
- keywords
- low level features
- video sequences
- image collections
- key frames
- multi stream
- video retrieval
- acoustic features