Login / Signup
Acoustic and Visual Knowledge Distillation for Contrastive Audio-Visual Localization.
Ehsan Yaghoubi
André Peter Kelm
Timo Gerkmann
Simone Frintrop
Published in:
ICMI (2023)
Keyphrases
</>
audio visual
visual information
visual data
multi modal
visual features
domain knowledge
multimedia
person authentication
image content
multi stream
temporal context
speaker verification
image collections
semantic information
data processing
knn
hidden markov models