Login / Signup
Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Temporal Synchronicity.
Pritam Sarkar
Ali Etemad
Published in:
CoRR (2021)
Keyphrases
</>
audio visual
cross modal
multi modal
visual data
perceptual information
visual recognition
multimedia retrieval
information retrieval
image database
text classification
semantic information
visual information
activity recognition