Legio: Fault Resiliency for Embarrassingly Parallel MPI Applications.
Roberto RoccoDavide GadioliGianluca PalermoPublished in: CoRR (2021)
Keyphrases
- shared memory
- parallel implementation
- parallelization strategy
- message passing interface
- distributed memory
- parallel computing
- parallel programming
- fault diagnosis
- message passing
- parallel algorithm
- parallel processing
- fault detection
- parallel architecture
- massively parallel
- high performance computing
- parallel computers
- database
- neural network
- databases
- real time
- parallel machines
- parallel computation
- scheduling problem
- general purpose
- computer systems
- parallel execution
- multiple faults
- parallel hardware