Sign in

Video and Audio are Images: A Cross-Modal Mixer for Original Data on Video-Audio Retrieval.

Zichen YuanQi ShenBingyi ZhengYuting LiuLinying JiangGuibing Guo
Published in: CoRR (2023)
Keyphrases