Login / Signup
A stopping rule for linear stochastic approximation.
Takayuki Wada
Takamitsu Itani
Yasumasa Fujisaki
Published in:
CDC (2010)
Keyphrases
</>
stochastic approximation
monte carlo
reinforcement learning
temporal difference learning
neural network
support vector machine