Login / Signup
Toward a perceptive pretraining framework for Audio-Visual Video Parsing.
Jianning Wu
Zhuqing Jiang
Qingchao Chen
Shiping Wen
Aidong Men
Haiying Wang
Published in:
Inf. Sci. (2022)
Keyphrases
</>
audio visual
temporal context
multi modal
sports video
visual data
multimedia
video sequences
video data
visual information
video summarization
feature selection
image features
video streams
audio features
multi stream
meeting room