Matrix Multiplication on GPUs with On-Line Fault Tolerance.
Chong DingChrister KarlssonHui LiuTeresa DaviesZizhong ChenPublished in: ISPA (2011)
Keyphrases
- fault tolerance
- matrix multiplication
- fault tolerant
- distributed systems
- message passing
- distributed computing
- response time
- high availability
- replicated databases
- load balancing
- distributed memory
- group communication
- peer to peer
- mobile agents
- matrix factorization
- high performance computing
- fault management
- database replication
- component failures
- data processing
- multi view
- single point of failure