Login / Signup
MAFormer: A Transformer Network with Multi-scale Attention Fusion for Visual Recognition.
Yunhao Wang
Huixin Sun
Xiaodi Wang
Bin Zhang
Chao Li
Ying Xin
Baochang Zhang
Errui Ding
Shumin Han
Published in:
CoRR (2022)
Keyphrases
</>
visual recognition
multiscale
image classification
visual recognition tasks
visual categorization
object recognition
view independent
latent topic models
image processing
distribution network
image fusion
natural images
scale space
neural network
visual attention
data fusion
image representation
three dimensional