Machine fault tolerance for reliable datacenter systems.
Danyang ZhuoQiao ZhangDan R. K. PortsArvind KrishnamurthyThomas E. AndersonPublished in: APSys (2014)
Keyphrases
- fault tolerance
- distributed systems
- fault tolerant
- single point of failure
- fault management
- distributed computing
- response time
- load balancing
- high scalability
- cooperative
- management system
- peer to peer
- intelligent systems
- data sets
- replicated databases
- group communication
- knowledge based systems
- knowledge base
- computer systems
- expert systems
- multi agent
- database replication
- database