Middleware to Manage Fault Tolerance Using Semi-Coordinated Checkpoints.
Alvaro WongElisa HeymannDolores RexachsEmilio LuquePublished in: IEEE Trans. Parallel Distributed Syst. (2021)
Keyphrases
- fault tolerance
- distributed systems
- fault tolerant
- mobile agents
- load balancing
- distributed computing
- response time
- high availability
- multi agent
- group communication
- peer to peer
- database replication
- replicated databases
- high scalability
- cooperative
- fault management
- single point of failure
- component failures
- data replication
- high performance computing
- distributed environment
- failure recovery
- error detection
- knowledge based systems