A Codesigned Fault Tolerance System for Heterogeneous Many-Core Processors.
Keun Soo YimRavishankar K. IyerPublished in: IPDPS Workshops (2011)
Keyphrases
- fault tolerance
- fault tolerant
- high performance computing
- distributed systems
- high availability
- load balancing
- response time
- parallel algorithm
- group communication
- peer to peer
- distributed computing
- database replication
- replicated databases
- fault management
- single point of failure
- data replication
- mobile agents
- shared memory
- failure recovery
- database
- error detection
- component failures
- parallel execution
- parallel computing
- intelligent agents
- operating system
- digital libraries
- knowledge base