Login / Signup

Bellman's principle of optimality and deep reinforcement learning for time-varying tasks.

Alessandro GiuseppiAntonio Pietrabissa
Published in: Int. J. Control (2022)
Keyphrases
  • reinforcement learning
  • transfer learning
  • learning process
  • multi agent
  • learning algorithm
  • optimal solution
  • optimal policy
  • model free
  • reinforcement learning algorithms
  • temporal difference learning