Login / Signup

Vision-Text Cross-Modal Fusion for Accurate Video Captioning.

Kaouther OuennicheRuxandra TapuTitus B. Zaharia
Published in: IEEE Access (2023)
Keyphrases