Datapath fault tolerance for parallel accelerators.
James J. DavisPeter Y. K. CheungPublished in: FPT (2013)
Keyphrases
- massively parallel
- fault tolerance
- fault tolerant
- parallel computing
- distributed systems
- distributed computing
- response time
- high availability
- load balancing
- fault management
- mobile agents
- error detection
- replicated databases
- peer to peer
- computing platform
- group communication
- failure recovery
- database replication
- data replication
- multi agent systems
- high scalability
- replica control