Sign in

DOA-Aware Audio-Visual Self-Supervised Learning for Sound Event Localization and Detection.

Yoto FujitaYoshiaki BandoKeisuke ImotoMasaki OnishiKazuyoshi Yoshii
Published in: APSIPA ASC (2023)
Keyphrases
  • audio visual
  • human computer interaction
  • spatio temporal
  • event detection
  • machine learning
  • image sequences
  • data analysis
  • multi modal
  • visual information