MMPI: A Scalable Fault Tolerance Mechanism for MPI Large Scale Parallel Computing.
Zhiyuan WangXuejun YangYun ZhouPublished in: CIT (2010)
Keyphrases
- parallel computing
- fault tolerance
- high scalability
- fault tolerant
- high performance computing
- massively parallel
- commodity hardware
- distributed systems
- shared memory
- load balancing
- peer to peer
- response time
- database replication
- replicated databases
- processing units
- parallel programming
- parallel computers
- distributed computing
- mobile agents
- mobile agent system
- message passing interface
- parallel computation
- computing systems
- computer architecture
- parallel machines
- parallel execution
- fault management
- map reduce
- parallel algorithm
- general purpose
- multithreading
- field programmable gate array
- image segmentation