CNN-LTE: A class of 1-X pooling convolutional neural networks on label tree embeddings for audio scene classification.
Huy PhanPhilipp KochLars HertelMarco MaaßRadoslaw MazurAlfred MertinsPublished in: ICASSP (2017)
Keyphrases
- scene classification
- convolutional neural networks
- multi instance multi label learning
- natural scenes
- object recognition
- indoor outdoor
- image classification
- image representation
- biologically inspired
- scene recognition
- spatial pyramid matching
- visual information
- visual words
- bag of features
- bag of visual words
- multimedia
- text categorization
- object detection