Supporting Cost-Effective Fault Tolerance in Distributed Message-Passing Applications with File Operations.
Jinsong OuyangPiyush MaheshwariPublished in: J. Supercomput. (1999)
Keyphrases
- message passing
- cost effective
- distributed systems
- fault tolerance
- fault tolerant
- matrix multiplication
- distributed computing
- distributed environment
- load balancing
- low cost
- high availability
- database replication
- mobile agents
- group communication
- factor graphs
- sum product algorithm
- peer to peer
- cost effectiveness
- single point of failure
- shared memory
- data replication
- inference in graphical models
- error detection
- replicated databases
- approximate inference
- sum product
- high performance computing
- grid computing
- failure recovery
- file system
- data center
- belief propagation
- data sets