: A System for Automating Application-Level Checkpointing of MPI Programs.
Greg BronevetskyDaniel MarquesKeshav PingaliPaul StodghillPublished in: LCPC (2003)
Keyphrases
- application level
- operating system
- distributed databases
- network services
- network management
- quality of service
- message passing
- distributed systems
- overlay network
- virtual machine
- parallel implementation
- shared memory
- bottle neck
- low overhead
- information technology
- programming environment
- parallel algorithm
- main memory databases
- distributed database systems
- high performance computing
- fault tolerance
- parallelization strategy
- low cost