An Architecture for Unattended Containerized (Deep) Reinforcement Learning with Webots.

Tobias Haubold Petra Linke

Published in: CoRR (2024)

Keyphrases

reinforcement learning
function approximation
monitoring system
optimal policy
model free
multi agent
markov decision processes
machine learning
learning algorithm
learning process
action space
state space
temporal difference
temporal difference learning
deep learning
resource constrained
transfer learning
optimal control
reinforcement learning algorithms
learning capabilities
mobile robot
dynamic programming
active learning
learning agents
database