Login / Signup
Self-supervised Learning of Audio Representations from Audio-Visual Data using Spatial Alignment.
Shanshan Wang
Archontis Politis
Annamaria Mesaros
Tuomas Virtanen
Published in:
CoRR (2022)
Keyphrases
</>
visual data
visual information
audio visual
visual features
image data
contextual information
multimedia data
multimedia
visual content
machine learning
object recognition
data management
video data
high dimensional data
human motion
multimodal information