A sandpile model for reliable actor-critic reinforcement learning.

Yiming Peng Gang Chen Mengjie Zhang Shaoning Pang

Published in: IJCNN (2017)

Keyphrases

reinforcement learning
model free
learning algorithm
objective function
function approximation
temporal difference
neural network
fuzzy logic
least squares
mathematical model
actor critic