Login / Signup

Does compressing activations help model parallel training?

Song BianDacheng LiHongyi WangEric P. XingShivaram Venkataraman
Published in: CoRR (2023)
Keyphrases