Login / Signup
CSS-Net: A Consistent Segment Selection Network for Audio-Visual Event Localization.
Fan Feng
Yue Ming
Nannan Hu
Hui Yu
Yuanan Liu
Published in:
IEEE Trans. Multim. (2024)
Keyphrases
</>
audio visual
multi modal
visual information
temporal context
multi stream
video summarization
multimedia
visual data
wireless sensor networks
audio visual speech recognition
databases
video sequences
co occurrence