Login / Signup

An Efficient Off-Policy Reinforcement Learning Algorithm for the Continuous-Time LQR Problem.

Victor G. LopezMatthias Albrecht Müller
Published in: CDC (2023)
Keyphrases
  • image segmentation
  • optimal control
  • markov chain
  • control system
  • markov processes
  • face recognition
  • wide range
  • dynamic programming
  • database
  • data sets
  • machine learning
  • information retrieval
  • dynamical systems