Network Fault Tolerance in Open MPI.
Galen M. ShipmanRichard L. GrahamGeorge BosilcaPublished in: Euro-Par (2007)
Keyphrases
- fault tolerance
- fault tolerant
- peer to peer
- fault management
- high performance computing
- single point of failure
- component failures
- distributed systems
- load balancing
- distributed computing
- response time
- group communication
- high availability
- node failures
- replicated databases
- wireless sensor
- mobile agents
- failure recovery
- error detection
- network management
- parallel implementation
- computer networks
- network traffic
- wireless sensor networks
- data replication
- overlay network
- massively parallel
- network resources
- data sets
- parallel computing
- data availability
- communication networks
- mobile agent system
- anomaly detection
- artificial intelligence