Login / Signup
Audio-Visual Event Localization by Learning Spatial and Semantic Co-Attention.
Cheng Xue
Xionghu Zhong
Minjie Cai
Hao Chen
Wenwu Wang
Published in:
IEEE Trans. Multim. (2023)
Keyphrases
</>
audio visual
spatio temporal
multi modal
machine learning
high level
visual data
multimedia
knn
spatial information
human motion