Sign in

Towards a Theoretical Foundation of Policy Optimization for Learning Control Policies.

Bin HuKaiqing ZhangNa LiMehran MesbahiMaryam FazelTamer Basar
Published in: CoRR (2022)
Keyphrases
  • theoretical foundation
  • control policies
  • learning algorithm
  • reinforcement learning
  • finite number