Achieving Target MTTF by Duplicating Reliability-Critical Components in High Performance Computing Systems.
Nithin NakkaAlok N. ChoudharyGary GriderJohn BentJames NunezSatsangat KhalsaPublished in: IPDPS Workshops (2011)
Keyphrases
- computing systems
- highly parallel
- computing technologies
- computer systems
- scientific computing
- autonomic computing systems
- high performance computing
- parallel computing
- autonomic computing
- graphics processing units
- processing units
- academia and industry
- machine learning
- database
- computing platform
- hardware platforms
- software developers
- parallel processing
- ubiquitous computing environments
- high end
- parallel architectures
- software development
- user interface
- mobile devices
- data mining