Login / Signup
Evaluating Multi-Level Checkpointing for Distributed Deep Neural Network Training.
Quentin Anthony
Donglai Dai
Published in:
SC (Workshops) (2021)
Keyphrases
</>
neural network training
distributed systems
neural network
distributed database systems
distributed environment
distributed databases
training algorithm
genetic algorithm
peer to peer
optimization method
low overhead
learning algorithm
decision trees
learning process
linear combination
failure recovery