From self-tuning regulators to reinforcement learning and back again.

Nikolai Matni Alexandre Proutière Anders Rantzer Stephen Tu

Published in: CDC (2019)

Keyphrases

reinforcement learning
function approximation
state space
machine learning
model free
reinforcement learning algorithms
markov decision processes
temporal difference
supervised learning
transfer learning
robotic control
temporal difference learning
pid controller
optimal policy
optimal control
co occurrence
genetic algorithm
data mining
real world
autonomous learning
multi agent reinforcement learning
database