Login / Signup

Robust Audio-Visual Contrastive Learning for Proposal-Based Self-Supervised Sound Source Localization in Videos.

Hanyu XuanZhiliang WuJian YangBo JiangLei LuoXavier Alameda-PinedaYan Yan
Published in: IEEE Trans. Pattern Anal. Mach. Intell. (2024)
Keyphrases
  • audio visual
  • source localization
  • multi modal
  • data sets
  • sound source
  • machine learning
  • information retrieval
  • pattern recognition
  • visual features
  • dynamic environments