Self-supervised object detection from audio-visual correspondence.
Triantafyllos AfourasYuki Markus AsanoFrancois FaganAndrea VedaldiFlorian MetzePublished in: CoRR (2021)
Keyphrases
- visual correspondence
- object detection
- multimedia
- computer vision
- object class
- mutual information
- visual information
- scene understanding
- pedestrian detection
- object categories
- background subtraction
- object classification
- face detection
- multi class
- audio visual
- object recognition
- deformable part models
- object segmentation
- image parsing
- audio video
- scene recognition
- human detection
- signal processing
- digital video
- audio signals
- visual data
- object detectors
- pattern recognition
- image processing