Login / Signup
Finite-Time Analysis of Entropy-Regularized Neural Natural Actor-Critic Algorithm.
Semih Cayci
Niao He
R. Srikant
Published in:
CoRR (2022)
Keyphrases
</>
learning rate
convergence rate
learning algorithm
objective function
dynamic programming
optimal solution
search space
cost function
np hard
neural network
least squares
linear programming
natural actor critic