A HPC Co-Scheduler with Reinforcement Learning.

Abel Souza Kristiaan Pelckmans Johan Tordsson

Published in: CoRR (2024)

Keyphrases

reinforcement learning
high performance computing
function approximation
model free
reinforcement learning algorithms
temporal difference
robotic control
multi agent reinforcement learning
fault tolerance
state space
scheduling algorithm
multi agent
scientific computing
neural network
markov decision processes
temporal difference learning
fault tolerant
action selection
dynamic programming
learning algorithm
reinforcement learning methods
policy search
real time