LF-ViT: Reducing Spatial Redundancy in Vision Transformer for Efficient Image Recognition.

Youbing HuYun ChengAnqi LuZhiqiang CaoDawei WeiJie LiuZhijun Li
Published in: AAAI (2024)
Keyphrases
  • image recognition
  • face recognition
  • computer vision
  • image processing
  • spatial information
  • pattern recognition
  • spatio temporal
  • vision system
  • image classification
  • neural network
  • d objects