SoftVAD in iVector-Based Acoustic Scene Classification for Robustness to Foreground Speech.
Siyuan SongBrecht DesplanquesKris DemuynckNilesh MadhuPublished in: EUSIPCO (2022)
Keyphrases
- scene classification
- object recognition
- biologically inspired
- image classification
- natural scenes
- indoor outdoor
- speech sounds
- scene recognition
- bag of features
- visual words
- image representation
- speech recognition
- speech signal
- moving objects
- scene representation
- natural images
- image data
- pairwise
- image sequences
- bag of words
- background subtraction
- higher order
- image retrieval
- feature space
- multiscale
- similarity measure