General-purpose tagging of Freesound audio with AudioSet labels: task description, dataset, and baseline.
Eduardo FonsecaManoj PlakalFrederic FontDaniel P. W. EllisXavier FavoryJordi PonsXavier SerraPublished in: DCASE (2018)
Keyphrases
- general purpose
- manually labeled
- special purpose
- benchmark datasets
- weakly labeled
- multimedia
- high level
- programming language
- class labels
- application specific
- visual information
- database
- domain specific
- signal processing
- tightly coupled
- training set
- visual data
- pairwise
- social tagging
- audio video
- training data
- images with ground truth