Login / Signup

SwinBERT: End-to-End Transformers with Sparse Attention for Video Captioning.

Kevin LinLinjie LiChung-Ching LinFaisal AhmedZhe GanZicheng LiuYumao LuLijuan Wang
Published in: CVPR (2022)
Keyphrases