A Checkpointing Strategy for Scalable Recovery on Distributed Parallel Systems.
Vijay K. NaikSamuel P. MidkiffJosé E. MoreiraPublished in: SC (1997)
Keyphrases
- distributed systems
- data intensive
- distributed database systems
- load balancing strategies
- load balancing strategy
- load balancing
- failure recovery
- computer systems
- lightweight
- distributed environment
- intelligent systems
- cooperative
- multi agent
- load balance
- parallel execution
- high end
- fault tolerant
- distributed databases
- retrieval systems
- parallel computing
- massively parallel
- management system
- distributed processing
- message passing
- mission critical
- master slave
- low overhead
- high scalability
- autonomous mobile
- complex systems
- peer to peer