Robust Audio-Visual Contrastive Learning for Proposal-Based Self-Supervised Sound Source Localization in Videos.

Published in: IEEE Trans. Pattern Anal. Mach. Intell. (2024)

Keyphrases