User-Level Socket-Based Checkpointing for Distributed and Parallel Computation
Jason AnselMichael RiekerGene CoopermanPublished in: CoRR (2007)
Keyphrases
- parallel computation
- distributed systems
- map reduce
- parallel algorithm
- parallel computing
- parallel implementation
- distributed databases
- distributed database systems
- parallel processing
- shared memory
- parallel programming
- low overhead
- integral image
- failure recovery
- user interface
- fault tolerance
- fault tolerant
- distributed environment
- end users