Login / Signup

GroundVLP: Harnessing Zero-shot Visual Grounding from Vision-Language Pre-training and Open-Vocabulary Object Detection.

Haozhan ShenTiancheng ZhaoMingwei ZhuJianwei Yin
Published in: CoRR (2023)
Keyphrases