Sign in

Grounding Everything: Emerging Localization Properties in Vision-Language Transformers.

Walid BousselhamFelix PetersenVittorio FerrariHilde Kuehne
Published in: CoRR (2023)
Keyphrases