Login / Signup
Optimistic Natural Policy Gradient: a Simple Efficient Policy Optimization Framework for Online RL.
Qinghua Liu
Gellért Weisz
András György
Chi Jin
Csaba Szepesvári
Published in:
NeurIPS (2023)
Keyphrases
</>
policy gradient
parametric optimization
actor critic
reinforcement learning
optimal policy
policy search
function approximation
optimal control
model free reinforcement learning
reinforcement learning algorithms
optimization algorithm
model free
action selection
gradient method
approximate dynamic programming