VidTIMIT audio visual phoneme recognition using AAM visual features and human auditory motivated acoustic wavelet features.
Astik BiswasPrakash Kumar SahuAnirban BhowmickMahesh ChandraPublished in: ReTIS (2015)
Keyphrases
- audio visual
- visual information
- visual features
- wavelet features
- visual data
- sound source
- acoustic features
- visual content
- image classification
- audio features
- image retrieval
- wavelet transform
- feature extraction
- low level
- image collections
- speaker verification
- key frames
- texture features
- speech recognition
- multi modal
- keywords
- machine learning
- visual speech
- image representation
- multimedia
- computer vision