An Architecture for Unattended Containerized (Deep) Reinforcement Learning with Webots.
Tobias HauboldPetra LinkePublished in: CoRR (2024)
Keyphrases
- reinforcement learning
- function approximation
- monitoring system
- optimal policy
- model free
- multi agent
- markov decision processes
- machine learning
- learning algorithm
- learning process
- action space
- state space
- temporal difference
- temporal difference learning
- deep learning
- resource constrained
- transfer learning
- optimal control
- reinforcement learning algorithms
- learning capabilities
- mobile robot
- dynamic programming
- active learning
- learning agents
- database