Replication-Based Fault Tolerance for MPI Applications.
John Paul WaltersVipin ChaudharyPublished in: IEEE Trans. Parallel Distributed Syst. (2009)
Keyphrases
- fault tolerance
- high performance computing
- fault tolerant
- distributed systems
- group communication
- database replication
- load balancing
- message passing
- replicated databases
- response time
- high availability
- distributed computing
- parallel implementation
- peer to peer
- parallel algorithm
- shared memory
- data replication
- parallel computing
- fault management
- single point of failure
- data sets
- mobile agents
- database systems
- node failures
- database
- replica control