Reliability-Aware Approach: An Incremental Checkpoint/Restart Model in HPC Environments.
Nichamon NaksinehaboonYudan LiuChokchai LeangsuksunRaja NassarMihaela PaunStephen L. ScottPublished in: CCGRID (2008)
Keyphrases
- real world
- management system
- high level
- mathematical model
- genetic algorithm
- random walk
- computational model
- formal model
- probabilistic model
- conceptual model
- dynamic environments
- reliability analysis
- real time
- incremental learning
- fault tolerance
- statistical model
- parameter estimation
- input data
- distributed systems
- data model
- evolutionary algorithm
- multiscale
- case study