Login / Signup

A stopping rule for linear stochastic approximation.

Takayuki WadaTakamitsu ItaniYasumasa Fujisaki
Published in: CDC (2010)
Keyphrases
  • stochastic approximation
  • monte carlo
  • reinforcement learning
  • temporal difference learning
  • neural network
  • support vector machine