A HPC Co-Scheduler with Reinforcement Learning.
Abel SouzaKristiaan PelckmansJohan TordssonPublished in: CoRR (2024)
Keyphrases
- reinforcement learning
- high performance computing
- function approximation
- model free
- reinforcement learning algorithms
- temporal difference
- robotic control
- multi agent reinforcement learning
- fault tolerance
- state space
- scheduling algorithm
- multi agent
- scientific computing
- neural network
- markov decision processes
- temporal difference learning
- fault tolerant
- action selection
- dynamic programming
- learning algorithm
- reinforcement learning methods
- policy search
- real time