Fault tolerance in heterogeneous multi-cluster systems through a task migration mechanism.
Uriel CabelloJosé RodríguezAmilcar MenesesSonia MendozaDominique DecouchantPublished in: CCE (2014)
Keyphrases
- fault tolerance
- fault tolerant
- distributed systems
- single point of failure
- fault management
- distributed computing
- computer systems
- high availability
- replicated databases
- group communication
- response time
- load balancing
- high performance computing
- data management
- data sets
- high scalability
- failure recovery
- database replication
- expert systems