• search
    search
  • reviewers
    reviewers
  • feeds
    feeds
  • assignments
    assignments
  • settings
  • logout

GroundVLP: Harnessing Zero-shot Visual Grounding from Vision-Language Pre-training and Open-Vocabulary Object Detection.

Haozhan ShenTiancheng ZhaoMingwei ZhuJianwei Yin
Published in: CoRR (2023)
Keyphrases