Architecting Peer-to-Peer Serverless Distributed Machine Learning Training for Improved Fault Tolerance.
Amine BarrakFábio PetrilloFehmi JaafarPublished in: CoRR (2023)
Keyphrases
- fault tolerance
- peer to peer
- machine learning
- distributed computing
- fault tolerant
- load balancing
- distributed systems
- distributed environment
- group communication
- high availability
- peer to peer networks
- single point of failure
- overlay network
- distributed query processing
- database replication
- replicated databases
- grid computing
- high scalability
- failure recovery
- fault management
- error detection
- video streaming
- mobile agents
- digital libraries
- file sharing
- ad hoc networks
- knowledge acquisition
- databases
- artificial intelligence
- high performance computing
- cooperative
- learning algorithm
- knowledge representation
- wireless sensor networks
- data analysis
- node failures