On the Viability of Checkpoint Compression for Extreme Scale Fault Tolerance.
Dewan IbteshamDorian C. ArnoldKurt B. FerreiraPatrick G. BridgesPublished in: Euro-Par Workshops (2) (2011)
Keyphrases
- fault tolerance
- fault tolerant
- distributed computing
- high availability
- distributed systems
- load balancing
- response time
- mobile agents
- replicated databases
- image compression
- peer to peer
- group communication
- fault management
- failure recovery
- data replication
- database replication
- single point of failure
- sensor data
- digital libraries