Online Model-Free RLSPI Algorithm for Nonlinear Discrete-Time Non-affine Systems.

Yuanheng Zhu Dongbin Zhao

Published in: ICONIP (2) (2013)

Keyphrases

model free
dynamic programming
learning algorithm
machine learning
monte carlo
linear systems
search space
linear programming
temporal difference
reinforcement learning algorithms