Paramixer: Parameterizing Mixing Links in Sparse Factors Works Better than Dot-Product Self-Attention.

Published in: CVPR (2022)

Keyphrases