Simplifying fault-tolerance: providing the abstraction of crash failures.
Rida A. BazziGil NeigerPublished in: J. ACM (2001)
Keyphrases
- fault tolerance
- fault tolerant
- failure recovery
- node failures
- distributed systems
- load balancing
- distributed computing
- component failures
- high availability
- response time
- replicated databases
- group communication
- single point of failure
- mobile agents
- database replication
- high scalability
- peer to peer
- error detection
- data replication
- fault management
- data sets