• search
    search
  • reviewers
    reviewers
  • feeds
    feeds
  • assignments
    assignments
  • settings
  • logout

An Efficient Off-Policy Reinforcement Learning Algorithm for the Continuous-Time LQR Problem.

Victor G. LopezMatthias Albrecht Müller
Published in: CDC (2023)
Keyphrases
  • image segmentation
  • optimal control
  • markov chain
  • control system
  • markov processes
  • face recognition
  • wide range
  • dynamic programming
  • database
  • data sets
  • machine learning
  • information retrieval
  • dynamical systems