Login / Signup
A stopping rule for simultaneous perturbation stochastic approximation.
Takayuki Wada
Yasumasa Fujisaki
Published in:
ECC (2013)
Keyphrases
</>
stochastic approximation
monte carlo
multi start
temporal difference learning
reinforcement learning
dynamic programming
theoretical guarantees
decision making
learning problems