Login / Signup
Induction Network: Audio-Visual Modality Gap-Bridging for Self-Supervised Sound Source Localization.
Tianyu Liu
Peng Zhang
Wei Huang
Yufei Zha
Tao You
Yanning Zhang
Published in:
ACM Multimedia (2023)
Keyphrases
</>
audio visual
sound source
source localization
multi modal
wireless sensor networks
visual information
multi stream
multimedia
visual data
audio features
audio visual speech recognition
speech signal
probabilistic model
information retrieval
machine learning
focus of attention
text classification
image processing