Fault Tolerance using Reinforcement Learning for Cloud Resource Management: Fault Tolerance using RL for Cloud Resource Management.
Prathamesh Vijay LahandeParag Ravikant KaveriPublished in: IC3 (2023)
Keyphrases
- resource management
- fault tolerance
- reinforcement learning
- fault tolerant
- resource allocation
- management system
- cloud computing
- response time
- distributed computing
- distributed systems
- load balancing
- high availability
- quality of service
- computing resources
- intelligent agents
- mobile agents
- peer to peer
- grid computing
- distributed query processing
- optimal policy
- resource utilization
- data center
- service level agreements
- expert systems
- failure recovery
- model free
- high performance computing
- fault management
- database replication
- data management
- learning algorithm
- network resources
- control policy
- computer systems
- artificial intelligence
- machine learning