Login / Signup
An Efficient Off-Policy Reinforcement Learning Algorithm for the Continuous-Time LQR Problem.
Victor G. Lopez
Matthias Albrecht Müller
Published in:
CDC (2023)
Keyphrases
</>
image segmentation
optimal control
markov chain
control system
markov processes
face recognition
wide range
dynamic programming
database
data sets
machine learning
information retrieval
dynamical systems