A Technique for Fault Tolerance Assessment of COTS Based Systems.
Ruben AlexanderssonD. Krishna ChaitanyaPeter ÖhmanYasir SirajPublished in: SAFECOMP (2005)
Keyphrases
- fault tolerance
- fault tolerant
- distributed systems
- fault management
- single point of failure
- load balancing
- group communication
- replicated databases
- response time
- distributed computing
- high scalability
- peer to peer
- mobile agents
- management system
- high performance computing
- high availability
- computer systems
- sensor data
- expert systems
- database replication
- reinforcement learning