Login / Signup

A stopping rule for simultaneous perturbation stochastic approximation.

Takayuki WadaYasumasa Fujisaki
Published in: ECC (2013)
Keyphrases
  • stochastic approximation
  • monte carlo
  • multi start
  • temporal difference learning
  • reinforcement learning
  • dynamic programming
  • theoretical guarantees
  • decision making
  • learning problems