MSDF-SGD: Most-Significant Digit-First Stochastic Gradient Descent for Arbitrary-Precision Training.

Published in: FPL (2023)

Keyphrases