Simple Open-Vocabulary Object Detection with Vision Transformers.
Matthias MindererAlexey A. GritsenkoAustin StoneMaxim NeumannDirk WeissenbornAlexey DosovitskiyAravindh MahendranAnurag ArnabMostafa DehghaniZhuoran ShenXiao WangXiaohua ZhaiThomas KipfNeil HoulsbyPublished in: CoRR (2022)