Login / Signup
Scalable Fault Tolerant MPI: Extending the Recovery Algorithm.
Graham E. Fagg
Thara Angskun
George Bosilca
Jelena Pjesivac-Grbovic
Jack J. Dongarra
Published in:
PVM/MPI (2005)
Keyphrases
</>
fault tolerant
recovery algorithm
fault tolerance
distributed systems
message passing
parallel implementation
high performance computing
load balancing
state machine
parallel algorithm
safety critical
shared memory
feature extraction
digital libraries
parallel computing
massively parallel