Login / Signup

Aligning Sight and Sound: Advanced Sound Source Localization Through Audio-Visual Alignment.

Arda SenocakHyeonggon RyuJunsik KimTae-Hyun OhHanspeter PfisterJoon Son Chung
Published in: CoRR (2024)
Keyphrases
  • sound source
  • audio visual
  • source localization
  • multi modal
  • visual information
  • visual data
  • audio features
  • multimedia
  • domain knowledge
  • multi stream
  • human computer interaction
  • action recognition
  • focus of attention