Login / Signup

Global-Shared Text Representation Based Multi-Stage Fusion Transformer Network for Multi-Modal Dense Video Captioning.

Yulai XieJingjing NiuYang ZhangFang Ren
Published in: IEEE Trans. Multim. (2024)
Keyphrases