Login / Signup

An online hyper-volume action bounding approach for accelerating the process of deep reinforcement learning from multiple controllers.

Ali AflakianAlireza RastegarpanahJamie HathawayRustam Stolkin
Published in: J. Field Robotics (2024)
Keyphrases
  • reinforcement learning
  • real time
  • machine learning
  • dynamic programming
  • upper bound
  • online learning
  • markov decision processes
  • development process
  • transition model