Login / Signup

Online Model-Free RLSPI Algorithm for Nonlinear Discrete-Time Non-affine Systems.

Yuanheng ZhuDongbin Zhao
Published in: ICONIP (2) (2013)
Keyphrases
  • model free
  • dynamic programming
  • learning algorithm
  • machine learning
  • monte carlo
  • linear systems
  • search space
  • linear programming
  • temporal difference
  • reinforcement learning algorithms