Sign in

Multimodal Transformer Networks with Latent Interaction for Audio-Visual Event Localization.

Yixuan HeXing XuXin LiuWeihua OuHuimin Lu
Published in: ICME (2021)
Keyphrases