DropCompute: simple and more robust distributed synchronous training via compute variance reduction.

Published in: CoRR (2023)

Keyphrases