Audio-visual Keyword Spotting for Mandarin Based on Discriminative Local Spatial-Temporal Descriptors.
Hong LiuTing FanPingping WuPublished in: ICPR (2014)
Keyphrases
- spatial temporal
- audio visual
- audio features
- multi modal
- speech recognition
- visual information
- temporal information
- spatio temporal
- spatial and temporal
- multimedia
- action recognition
- visual data
- video shots
- spatial information
- feature extraction
- broadcast news
- feature selection
- feature vectors
- pattern recognition
- image sequences
- web pages