Login / Signup
Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet.
Li Yuan
Yunpeng Chen
Tao Wang
Weihao Yu
Yujun Shi
Zihang Jiang
Francis E. H. Tay
Jiashi Feng
Shuicheng Yan
Published in:
ICCV (2021)
Keyphrases
</>
computer vision
linear svm
vision system
computer software
training set
real time
small number
training phase
logistic regression
feature extraction
genetic algorithm
multiscale
text classification
training examples
image collections
image processing
neural network