Marriage Between Coordinated and Uncoordinated Checkpointing for the Exascale Era.
Omer SubasiFerad ZyulkyarovOsman S. UnsalJesús LabartaPublished in: HPCC/CSS/ICESS (2015)
Keyphrases
- high performance computing
- fault tolerance
- scientific computing
- distributed databases
- fault tolerant
- main memory databases
- multi agent
- distributed database systems
- failure recovery
- low overhead
- response time
- massively parallel
- distributed systems
- load balancing
- computing systems
- information explosion
- computing resources
- parallel computing
- computing environments
- big data
- cooperative