From self-tuning regulators to reinforcement learning and back again.
Nikolai MatniAlexandre ProutièreAnders RantzerStephen TuPublished in: CDC (2019)
Keyphrases
- reinforcement learning
- function approximation
- state space
- machine learning
- model free
- reinforcement learning algorithms
- markov decision processes
- temporal difference
- supervised learning
- transfer learning
- robotic control
- temporal difference learning
- pid controller
- optimal policy
- optimal control
- co occurrence
- genetic algorithm
- data mining
- real world
- autonomous learning
- multi agent reinforcement learning
- database