Login / Signup
Extending the scope of the Checkpoint-on-Failure protocol for forward recovery in standard MPI.
Wesley Bland
Peng Du
Aurélien Bouteiller
Thomas Hérault
George Bosilca
Jack J. Dongarra
Published in:
Concurr. Comput. Pract. Exp. (2013)
Keyphrases
</>
failure recovery
fault tolerance
failure detection
neural network
lightweight
fault tolerant
parallel algorithm
high performance computing
coloured petri nets
data sets
general purpose
graphical models
bi directional
image recovery