Sign in

Policy Mirror Descent for Regularized Reinforcement Learning: A Generalized Framework with Linear Convergence.

Wenhao ZhanShicong CenBaihe HuangYuxin ChenJason D. LeeYuejie Chi
Published in: SIAM J. Optim. (2023)
Keyphrases
  • reinforcement learning
  • optimal policy
  • machine learning
  • state space
  • neural network
  • main contribution
  • theoretical framework
  • risk minimization
  • least squares
  • action selection