Login / Signup
Robust Audio-Visual Contrastive Learning for Proposal-Based Self-Supervised Sound Source Localization in Videos.
Hanyu Xuan
Zhiliang Wu
Jian Yang
Bo Jiang
Lei Luo
Xavier Alameda-Pineda
Yan Yan
Published in:
IEEE Trans. Pattern Anal. Mach. Intell. (2024)
Keyphrases
</>
audio visual
source localization
multi modal
data sets
sound source
machine learning
information retrieval
pattern recognition
visual features
dynamic environments