Login / Signup
An online hyper-volume action bounding approach for accelerating the process of deep reinforcement learning from multiple controllers.
Ali Aflakian
Alireza Rastegarpanah
Jamie Hathaway
Rustam Stolkin
Published in:
J. Field Robotics (2024)
Keyphrases
</>
reinforcement learning
real time
machine learning
dynamic programming
upper bound
online learning
markov decision processes
development process
transition model