Login / Signup

SPARTAN: Sparse Hierarchical Memory for Parameter-Efficient Transformers.

Ameet DeshpandeMd. Arafat SultanAnthony FerrittoAshwin KalyanKarthik NarasimhanAvirup Sil
Published in: CoRR (2022)
Keyphrases