Login / Signup
Finite-Sample Analysis of Off-Policy Natural Actor-Critic With Linear Function Approximation.
Zaiwei Chen
Sajad Khodadadian
Siva Theja Maguluri
Published in:
IEEE Control. Syst. Lett. (2022)
Keyphrases
</>
function approximation
natural actor critic
reinforcement learning
finite sample
radial basis function
temporal difference
decision trees
text classification
learning tasks
function approximators