Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers.
Dahun KimAnelia AngelovaWeicheng KuoPublished in: CVPR (2023)
Keyphrases
- object detection
- computer vision
- object classification
- object recognition
- scene understanding
- vision system
- object categories
- real time
- image processing
- object class
- input image
- face detection
- machine learning
- region of interest
- pedestrian detection
- multi class
- keywords
- background subtraction
- image regions
- metadata
- grey level
- human vision
- scene recognition