Login / Signup

Layer-wise enhanced transformer with multi-modal fusion for image caption.

Jingdan LiYi WangDexin Zhao
Published in: Multim. Syst. (2023)
Keyphrases