Fault tolerant MapReduce-MPI for HPC clusters.
Yanfei GuoWesley BlandPavan BalajiXiaobo ZhouPublished in: SC (2015)
Keyphrases
- fault tolerant
- fault tolerance
- high performance computing
- distributed computing
- message passing interface
- distributed systems
- parallel computing
- load balancing
- clustering algorithm
- high availability
- grid computing
- parallel programming
- data points
- cloud computing
- state machine
- computer architecture
- message passing
- database
- parallel algorithm
- artificial intelligence
- parallel implementation
- parallel processing
- interconnection networks