A Channel Memory based fault tolerance for MPI applications.
Anton SelikhovC. GermaiPublished in: Future Gener. Comput. Syst. (2005)
Keyphrases
- fault tolerance
- high performance computing
- fault tolerant
- distributed systems
- load balancing
- message passing
- distributed computing
- response time
- group communication
- high availability
- multi channel
- peer to peer
- mobile agents
- replicated databases
- parallel algorithm
- shared memory
- parallel implementation
- fault management
- failure recovery
- databases
- component failures
- data replication
- high scalability
- sensor nodes
- single point of failure
- wireless sensor networks