Sign in

Multi-scale network with shared cross-attention for audio-visual correlation learning.

Jiwei ZhangYi YuSuhua TangWei LiJianming Wu
Published in: Neural Comput. Appl. (2023)
Keyphrases
  • audio visual
  • multiscale
  • multi modal
  • information retrieval
  • human body