Login / Signup
Policy Mirror Descent for Regularized Reinforcement Learning: A Generalized Framework with Linear Convergence.
Wenhao Zhan
Shicong Cen
Baihe Huang
Yuxin Chen
Jason D. Lee
Yuejie Chi
Published in:
CoRR (2021)
Keyphrases
</>
reinforcement learning
main contribution
optimal policy
neural network
multi agent systems
machine learning
supervised learning
sequential decision making