Login / Signup

MSViT: Dynamic Mixed-scale Tokenization for Vision Transformers.

Jakob Drachmann HavtornAmélie RoyerTijmen BlankevoortBabak Ehteshami Bejnordi
Published in: ICCV (Workshops) (2023)
Keyphrases
  • computer vision
  • vision system
  • real time
  • neural network
  • machine learning
  • information systems
  • scale space
  • document collections
  • dynamically changing