Training Vision Transformers with Only 2040 Images.
Yun-Hao CaoHao YuJianxin WuPublished in: CoRR (2022)
Keyphrases
- image data
- ground truth
- input image
- three dimensional
- image analysis
- image features
- image database
- test images
- image retrieval
- image classification
- object recognition
- multiple images
- image annotation
- rigid body
- image set
- image collections
- training and testing data
- image understanding
- real time
- image regions
- edge detection
- lighting conditions
- keypoints
- vision system
- small number
- object detectors
- image registration
- original images
- computer vision