-policy and stochastic restarting scheme.

Qihui Bu Yun Sun Xudong Chai Liwei Liu

Published in: Appl. Math. Comput. (2020)

Keyphrases

optimal policy
detection scheme
monte carlo
learning scheme
state dependent
approximation schemes
learning automaton
model free reinforcement learning
neural network
reinforcement learning
stochastic model
learning automata