Application-Level Correctness and its Impact on Fault Tolerance.
Xuanhua LiDonald YeungPublished in: HPCA (2007)
Keyphrases
- fault tolerance
- application level
- fault tolerant
- distributed computing
- distributed systems
- operating system
- load balancing
- overlay network
- peer to peer
- quality of service
- response time
- high availability
- virtual machine
- network management
- database replication
- group communication
- node failures
- single point of failure
- bottle neck
- mobile agents
- replicated databases
- artificial intelligence
- ad hoc networks
- error detection
- sensor nodes
- data replication
- routing protocol
- failure recovery
- fault management
- sensor networks
- data sets