Login / Signup
-policy and stochastic restarting scheme.
Qihui Bu
Yun Sun
Xudong Chai
Liwei Liu
Published in:
Appl. Math. Comput. (2020)
Keyphrases
</>
optimal policy
detection scheme
monte carlo
learning scheme
state dependent
approximation schemes
learning automaton
model free reinforcement learning
neural network
reinforcement learning
stochastic model
learning automata