Resilience-Aware Resource Management for Exascale Computing Systems.
Daniel DauweSudeep PasrichaAnthony A. MaciejewskiHoward Jay SiegelPublished in: IEEE Trans. Sustain. Comput. (2018)
Keyphrases
- computing systems
- resource management
- high performance computing
- scientific computing
- computing resources
- management system
- computer systems
- resource allocation
- grid computing
- parallel computing
- computing technologies
- autonomic computing
- intelligent agents
- quality of service
- resource utilization
- fault tolerance
- machine learning
- massively parallel
- software developers
- parallel algorithm
- context aware
- highly parallel
- open source
- databases