Login / Signup

Discrete stochastic approximation via simultaneous difference approximations.

Stacy D. HillLászló GerencsérZsuzsanna Vágó
Published in: ACC (2005)
Keyphrases
  • stochastic approximation
  • monte carlo
  • policy iteration
  • finite number
  • reinforcement learning
  • markov decision processes
  • optimal policy
  • temporal difference learning