Login / Signup

Scene-Graph ViT: End-to-End Open-Vocabulary Visual Relationship Detection.

Tim SalzmannMarkus RyllAlex BewleyMatthias Minderer
Published in: CoRR (2024)
Keyphrases