Linear Convergence of Entropy-Regularized Natural Policy Gradient with Linear Function Approximation.

Semih Cayci Niao He R. Srikant

Published in: CoRR (2021)

Keyphrases

function approximation
reinforcement learning
function approximators
policy gradient
decision trees
least squares
learning tasks
convergence rate
multi agent systems
state space
temporal difference
reinforcement learning algorithms