Evaluating an XOR-based Hybrid Fault Tolerance Technique to Detect Faults in GPU Pipelines.
Giani Augusto BragaMarcio M. GonçalvesJosé Rodrigo AzambujaPublished in: ISVLSI (2023)
Keyphrases
- fault tolerance
- fault tolerant
- error detection
- distributed systems
- load balancing
- distributed computing
- high availability
- group communication
- response time
- replicated databases
- peer to peer
- mobile agents
- database replication
- fault management
- intelligent agents
- high performance computing
- failure recovery
- fault diagnosis
- database management systems
- parallel computing
- data collection
- data streams
- data replication
- metadata
- artificial intelligence
- databases