Audio-visual event classification via spatial-temporal-audio words.
Yu CaoSung BaangShih-Hsi LiuMing LiSanqing HuPublished in: ICPR (2008)
Keyphrases
- audio visual
- spatial temporal
- multi modal
- visual information
- audio visual speech recognition
- multimedia
- multi stream
- visual data
- spatio temporal
- machine learning
- feature space
- audio features
- feature selection
- temporal information
- spatial and temporal
- text classification
- action recognition
- image classification
- visual content
- text mining
- low level