A user-level library for fault tolerance on shared memory multicore systems.
Hamid MushtaqZaid Al-ArsKoen BertelsPublished in: DDECS (2012)
Keyphrases
- shared memory
- fault tolerance
- distributed systems
- message passing
- fault tolerant
- heterogeneous platforms
- single point of failure
- load balancing
- fault management
- parallel algorithm
- distributed memory
- parallel computing
- computer systems
- parallel programming
- database replication
- operating system
- distributed computing
- address space
- response time
- parallel architectures
- high performance computing
- high end
- intelligent systems
- mobile agents