Neural gradients are lognormally distributed: understanding sparse and quantized training.

Published in: CoRR (2020)

Keyphrases