1-bit Adam: Communication Efficient Large-Scale Training with Adam's Convergence Speed.

Published in: CoRR (2021)

Keyphrases