Binaural Signal Representations for Joint Sound Event Detection and Acoustic Scene Classification.
Daniel Aleksander KrauseAnnamaria MesarosPublished in: EUSIPCO (2022)
Keyphrases
- event detection
- sound source
- scene classification
- object recognition
- mid level
- natural scenes
- image classification
- biologically inspired
- video event detection
- audio visual
- visual words
- video surveillance
- image representation
- activity recognition
- speech signal
- bag of features
- bag of visual words
- natural images
- multi modal
- image features
- machine learning
- bag of words
- visual features
- information retrieval