Login / Signup
Policy Mirror Descent for Regularized Reinforcement Learning: A Generalized Framework with Linear Convergence.
Wenhao Zhan
Shicong Cen
Baihe Huang
Yuxin Chen
Jason D. Lee
Yuejie Chi
Published in:
SIAM J. Optim. (2023)
Keyphrases
</>
reinforcement learning
optimal policy
machine learning
state space
neural network
main contribution
theoretical framework
risk minimization
least squares
action selection