Sign in

STVGFormer: Spatio-Temporal Video Grounding with Static-Dynamic Cross-Modal Understanding.

Zihang LinChaolei TanJian-Fang HuZhi JinTiancai YeWei-Shi Zheng
Published in: CoRR (2022)
Keyphrases