Transparent Fault Tolerance for Stateful Applications in Kubernetes with Checkpoint/Restore.
Henri SchmidtZeineb RejibaRaphael EidenbenzKlaus-Tycho FörsterPublished in: SRDS (2023)
Keyphrases
- fault tolerance
- fault tolerant
- distributed systems
- load balancing
- distributed computing
- high availability
- replicated databases
- response time
- group communication
- fault management
- peer to peer
- mobile agents
- single point of failure
- data replication
- distributed query processing
- high performance computing
- error detection
- database replication
- distributed databases
- cooperative
- databases
- data sets