Login / Signup
Toward a Theoretical Foundation of Policy Optimization for Learning Control Policies.
Bin Hu
Kaiqing Zhang
Na Li
Mehran Mesbahi
Maryam Fazel
Tamer Basar
Published in:
Annu. Rev. Control. Robotics Auton. Syst. (2023)
Keyphrases
</>
autonomous robots
theoretical foundation
control policies
theoretical framework
reinforcement learning
optimization problems
optimal policy
mathematical model
continuous state
real time
learning algorithm
linear programming
finite horizon