Login / Signup

Dual-Scale Alignment-Based Transformer on Linguistic Skeleton Tags for Non-Autoregressive Video Captioning.

Xian ZhongYi ZhangShuqin ChenZhixin SunHuantao ZhengKui Jiang
Published in: ICME (2022)
Keyphrases