CRAFT: A Library for Easier Application-Level Checkpoint/Restart and Automatic Fault Tolerance.
Faisal ShahzadJonas ThiesMoritz KreutzerThomas ZeiserGeorg HagerGerhard WelleinPublished in: IEEE Trans. Parallel Distributed Syst. (2019)
Keyphrases
- fault tolerance
- application level
- fault tolerant
- peer to peer
- distributed computing
- load balancing
- distributed systems
- overlay network
- response time
- high availability
- operating system
- network management
- fault management
- quality of service
- replicated databases
- group communication
- mobile agents
- virtual machine
- database replication
- bottle neck
- error detection
- failure recovery
- single point of failure
- database
- anomaly detection
- data collection
- databases
- data sets