Login / Signup

Concentration bounds for two time scale stochastic approximation.

Vivek S. BorkarSarath Pattathil
Published in: Allerton (2018)
Keyphrases
  • stochastic approximation
  • monte carlo
  • upper bound
  • multi start
  • theoretical guarantees
  • lower bound
  • worst case
  • reinforcement learning
  • dynamic programming
  • model free
  • policy iteration