Job Migration and Fault Tolerance in SLA-Aware Resource Management Systems.
Dominic BattréMatthias HovestadtOdej KaoAxel KellerKerstin VoßPublished in: GPC Workshops (2008)
Keyphrases
- fault tolerance
- management system
- fault tolerant
- resource management
- distributed computing
- service level agreements
- high availability
- distributed systems
- response time
- load balancing
- peer to peer
- database replication
- quality of service
- replicated databases
- service providers
- database management systems
- group communication
- resource allocation
- mobile agents
- error detection
- cloud computing
- fault management
- component failures
- failure recovery
- data replication
- grid environment
- high performance computing
- grid computing
- data collection
- computing resources
- single point of failure