Login / Signup
MSViT: Dynamic Mixed-scale Tokenization for Vision Transformers.
Jakob Drachmann Havtorn
Amélie Royer
Tijmen Blankevoort
Babak Ehteshami Bejnordi
Published in:
ICCV (Workshops) (2023)
Keyphrases
</>
computer vision
vision system
real time
neural network
machine learning
information systems
scale space
document collections
dynamically changing