Scalable Distributed Consensus to Support MPI Fault Tolerance.
Darius BuntinasPublished in: EuroMPI (2011)
Keyphrases
- fault tolerance
- fault tolerant
- scalable distributed
- group communication
- high performance computing
- load balancing
- high availability
- distributed computing
- distributed systems
- peer to peer
- failure recovery
- grid computing
- database replication
- mobile agents
- data replication
- replicated databases
- single point of failure
- message passing
- parallel algorithm
- long running
- response time
- fault management