Login / Signup
Finite-Sample Analysis of Off-Policy Natural Actor-Critic with Linear Function Approximation.
Zaiwei Chen
Sajad Khodadadian
Siva Theja Maguluri
Published in:
CoRR (2021)
Keyphrases
</>
function approximation
natural actor critic
reinforcement learning
robot arm
finite sample
neural network
feature selection
decision trees
feature space
radial basis function
temporal difference