Login / Signup
UniAV: Unified Audio-Visual Perception for Multi-Task Video Localization.
Tiantian Geng
Teng Wang
Yanfu Zhang
Jinming Duan
Weili Guan
Feng Zheng
Published in:
CoRR (2024)
Keyphrases
</>
visual perception
multi task
multimedia
multi task learning
learning tasks
visual attention
multiple tasks
video data
learning problems
video sequences
multi class
gaussian processes
feature selection
sparse learning
transfer learning
real time
video frames
eye tracking
multiscale
information gain
high level