High Performance Checksum Computation for Fault-Tolerant MPI over Infiniband.
Alexandre DenisFrançois TrahayYutaka IshikawaPublished in: EuroMPI (2012)
Keyphrases
- fault tolerant
- fault tolerance
- parallel computers
- distributed systems
- parallel implementation
- high performance computing
- general purpose
- state machine
- message passing
- high availability
- parallel algorithm
- parallel computing
- load balancing
- shared memory
- high assurance
- interconnection networks
- distributed memory
- parallel computation
- massively parallel
- data management
- response time