Mirage: Towards Low-interruption Services on Batch GPU Clusters with Reinforcement Learning.
Qiyang DingPengfei ZhengShreyas KudariShivaram VenkataramanZhao ZhangPublished in: CoRR (2023)
Keyphrases
- reinforcement learning
- web services
- clustering algorithm
- service oriented
- real time
- cluster analysis
- service providers
- service discovery
- learning algorithm
- batch mode
- service quality
- data points
- state space
- highly correlated
- fuzzy clustering
- markov decision processes
- hierarchical clustering
- optimal control
- information services
- mobile applications
- service delivery
- graphics hardware
- data clustering
- multi agent
- context aware
- learning process
- self organizing maps
- high dimensional
- machine learning
- digital libraries
- reinforcement learning algorithms
- feature space
- parallel implementation
- subspace clustering
- web technologies
- service oriented architecture
- service composition
- dynamic programming
- function approximation
- optimal policy