Fault tolerant high performance computing by a coding approach.
Zizhong ChenGraham E. FaggEdgar GabrielJulien LangouThara AngskunGeorge BosilcaJack J. DongarraPublished in: PPOPP (2005)
Keyphrases
- high performance computing
- fault tolerant
- fault tolerance
- scientific computing
- distributed systems
- computational science
- load balancing
- distributed computing
- grid computing
- massively parallel
- computing systems
- error detection
- computing resources
- computer architecture
- databases
- artificial intelligence
- cloud computing
- computing environments
- parallel computing
- knowledge based systems
- multi agent