Sign in

Simultaneous perturbation stochastic approximation: towards one-measurement per iteration.

Shiru LiYong XiaZi Xu
Published in: Numer. Algorithms (2023)
Keyphrases
  • stochastic approximation
  • monte carlo
  • multi start
  • reinforcement learning
  • search algorithm
  • temporal difference learning