Network Fault Tolerance in LA-MPI.
Rob T. AulwesDavid J. DanielNehal N. DesaiRichard L. GrahamL. Dean RisingerMitchel W. SukalskiMark A. TaylorPublished in: PVM/MPI (2003)
Keyphrases
- fault tolerance
- fault tolerant
- peer to peer
- fault management
- high performance computing
- single point of failure
- load balancing
- high availability
- distributed computing
- component failures
- distributed systems
- response time
- mobile agents
- database replication
- wireless sensor networks
- network structure
- error detection
- group communication
- failure recovery
- network traffic
- node failures
- sensor nodes
- wireless sensor
- replicated databases
- replica control
- parallel algorithm
- message passing
- communication networks
- data replication
- network management
- distributed environment