Login / Signup
A sandpile model for reliable actor-critic reinforcement learning.
Yiming Peng
Gang Chen
Mengjie Zhang
Shaoning Pang
Published in:
IJCNN (2017)
Keyphrases
</>
reinforcement learning
model free
learning algorithm
objective function
function approximation
temporal difference
neural network
fuzzy logic
least squares
mathematical model
actor critic