Login / Signup

GroundVLP: Harnessing Zero-Shot Visual Grounding from Vision-Language Pre-training and Open-Vocabulary Object Detection.

Haozhan ShenTiancheng ZhaoMingwei ZhuJianwei Yin
Published in: AAAI (2024)
Keyphrases