Login / Signup
Finite-Sample Analysis of Off-Policy Natural Actor-Critic Algorithm.
Sajad Khodadadian
Zaiwei Chen
Siva Theja Maguluri
Published in:
CoRR (2021)
Keyphrases
</>
dynamic programming
error bounds
natural actor critic
learning algorithm
objective function
expectation maximization
finite sample
theoretical analysis
convergence rate
optimal solution
np hard
sample size
robot arm