Transparent Fault Tolerance for Parallel Applications on Networks of Workstations.
Daniel J. ScalesMonica S. LamPublished in: USENIX Annual Technical Conference (1996)
Keyphrases
- fault tolerance
- fault tolerant
- load balancing
- node failures
- high availability
- distributed computing
- distributed systems
- distributed memory
- response time
- group communication
- peer to peer
- parallel implementation
- error detection
- replicated databases
- database replication
- failure recovery
- mobile agents
- single point of failure
- fault management
- massively parallel
- sensor networks
- shared memory
- data replication
- high scalability
- high performance computing
- sensor nodes
- computational intelligence