Monetary cost optimizations for MPI-based HPC applications on Amazon clouds: checkpoints and replicated execution.
Yifan GongBingsheng HeAmelie Chi ZhouPublished in: SC (2015)
Keyphrases
- high performance computing
- message passing interface
- fault tolerance
- fault tolerant
- scientific computing
- general purpose
- testing process
- resource consumption
- massively parallel
- parallelization strategy
- cloud computing
- parallel algorithm
- parallel computing
- distributed computing
- expected cost
- scheduling problem
- total cost
- cost sensitive
- fine grained
- sensor networks