Login / Signup
Enhanced Sound Event Localization and Detection in Real 360-degree audio-visual soundscapes.
Adrian S. Roman
Baladithya Balamurugan
Rithik Pothuganti
Published in:
CoRR (2024)
Keyphrases
</>
audio visual
multi modal
event detection
sound source
video scene
multi stream
visual information
temporal context
visual data
multimedia
video summarization
feature extraction
person authentication
audio visual speech recognition
image data
sports video