Login / Signup
Generalized Simultaneous Perturbation Stochastic Approximation with Reduced Estimator Bias.
Shalabh Bhatnagar
Prashanth L. A.
Published in:
CoRR (2022)
Keyphrases
</>
stochastic approximation
monte carlo
variance reduction
least squares
temporal difference learning
neural network
reinforcement learning
policy iteration
particle filter
probabilistic model