Login / Signup
Into the Wild with AudioScope: Unsupervised Audio-Visual Separation of On-Screen Sounds.
Efthymios Tzinis
Scott Wisdom
Aren Jansen
Shawn Hershey
Tal Remez
Dan Ellis
John R. Hershey
Published in:
ICLR (2021)
Keyphrases
</>
audio visual
sound source
multi modal
visual information
visual data
emotion recognition
temporal context
multimedia
multi stream
audio features
speaker verification
person authentication
databases
machine learning
domain knowledge
multimodal fusion