Max Pooling with Vision Transformers Reconciles Class and Shape in Weakly Supervised Semantic Segmentation.

Published in: ECCV (30) (2022)

Keyphrases