To checkpoint or not to checkpoint: Understanding energy-performance-I/O tradeoffs in HPC checkpointing.
Nosayba El-SayedBianca SchroederPublished in: CLUSTER (2014)
Keyphrases
- fault tolerance
- fault tolerant
- high performance computing
- load balancing
- response time
- distributed systems
- distributed computing
- peer to peer
- failure recovery
- mobile agents
- energy consumption
- input output
- energy minimization
- scientific computing
- high speed
- energy efficiency
- design decisions
- main memory
- energy efficient
- databases