HARNESS and fault tolerant MPI.
Graham E. FaggAntonin BukovskyJack J. DongarraPublished in: Parallel Comput. (2001)
Keyphrases
- fault tolerant
- fault tolerance
- distributed systems
- high performance computing
- message passing
- parallel algorithm
- parallel implementation
- message passing interface
- load balancing
- shared memory
- state machine
- high availability
- parallelization strategy
- distributed memory
- mobile agent system
- massively parallel
- safety critical
- parallel computing
- data replication
- software development
- database