Automatic and coordinated job recovery for high performance computing.
Wei TangZhiling LanNarayan DesaiDaniel BuettnerPublished in: MTAGS@SC (2010)
Keyphrases
- high performance computing
- scientific computing
- computational science
- massively parallel
- parallel computing
- computing systems
- national laboratory
- grid computing
- computing resources
- energy efficiency
- molecular dynamics
- computing environments
- high performance data mining
- fault tolerance
- database
- computing infrastructure
- heterogeneous computing
- multi agent
- data center
- management system
- special case
- database systems