Fault Tolerance in Iterative-Convergent Machine Learning.
Aurick QiaoBryon AragamBingjing ZhangEric P. XingPublished in: CoRR (2018)
Keyphrases
- fault tolerance
- machine learning
- fault tolerant
- distributed systems
- load balancing
- distributed computing
- response time
- replicated databases
- high availability
- peer to peer
- group communication
- high performance computing
- database replication
- artificial intelligence
- high scalability
- error detection
- single point of failure
- fault management
- mobile agents
- knowledge acquisition
- data replication
- distributed query processing
- failure recovery
- databases
- computational intelligence
- multi agent
- reinforcement learning