Login / Signup

ViT-CoMer: Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense Predictions.

Chunlong XiaXinliang WangFeng LvXin HaoYifeng Shi
Published in: CoRR (2024)
Keyphrases