Proactive fault tolerance for HPC with Xen virtualization.
Arun Babu NagarajanFrank MuellerChristian EngelmannStephen L. ScottPublished in: ICS (2007)
Keyphrases
- fault tolerance
- high availability
- distributed computing
- fault tolerant
- high performance computing
- virtual machine
- distributed systems
- response time
- operating system
- load balancing
- group communication
- peer to peer
- mobile agents
- cloud computing
- replicated databases
- database replication
- failure recovery
- grid computing
- context aware
- fault management
- node failures
- replica control
- artificial intelligence
- data replication
- distributed environment
- multi agent systems
- knowledge base