Token Labeling: Training a 85.4% Top-1 Accuracy Vision Transformer with 56M Parameters on ImageNet.

Published in: CoRR (2021)

Keyphrases