Login / Signup
Linear Convergence of Entropy-Regularized Natural Policy Gradient with Linear Function Approximation.
Semih Cayci
Niao He
R. Srikant
Published in:
CoRR (2021)
Keyphrases
</>
function approximation
reinforcement learning
function approximators
policy gradient
decision trees
least squares
learning tasks
convergence rate
multi agent systems
state space
temporal difference
reinforcement learning algorithms