Login / Signup
Checkpointing Orchestration: Toward a Scalable HPC Fault-Tolerant Environment.
Hui Jin
Tao Ke
Yong Chen
Xian-He Sun
Published in:
CCGRID (2012)
Keyphrases
</>
fault tolerance
fault tolerant
distributed systems
high performance computing
load balancing
high availability
distributed computing
failure recovery
response time
error detection
mobile robot
data structure
data streams
safety critical
mobile agent system