Login / Signup

Audio-Visual Event Localization by Learning Spatial and Semantic Co-Attention.

Cheng XueXionghu ZhongMinjie CaiHao ChenWenwu Wang
Published in: IEEE Trans. Multim. (2023)
Keyphrases
  • audio visual
  • spatio temporal
  • multi modal
  • machine learning
  • high level
  • visual data
  • multimedia
  • knn
  • spatial information
  • human motion