Spatio-Temporal Attention Pooling for Audio Scene Classification.
Huy PhanOliver Y. ChénLam Dang PhamPhilipp KochMaarten De VosIan McLoughlinAlfred MertinsPublished in: CoRR (2019)
Keyphrases
- scene classification
- spatio temporal
- object recognition
- natural scenes
- spatial pyramid matching
- image classification
- biologically inspired
- indoor outdoor
- visual words
- image representation
- scene recognition
- bag of visual words
- visual attention
- scene representation
- multimedia
- spatial and temporal
- spatial layout
- moving objects
- visual information
- natural images
- image sequences
- face recognition
- computer vision
- image processing
- image segmentation
- bag of features
- object detection
- pairwise
- bag of words
- machine learning
- higher order