Sign in

Collaborative Static and Dynamic Vision-Language Streams for Spatio-Temporal Video Grounding.

Zihang LinChaolei TanJian-Fang HuZhi JinTiancai YeWei-Shi Zheng
Published in: CVPR (2023)
Keyphrases