Login / Signup

A sandpile model for reliable actor-critic reinforcement learning.

Yiming PengGang ChenMengjie ZhangShaoning Pang
Published in: IJCNN (2017)
Keyphrases
  • reinforcement learning
  • model free
  • learning algorithm
  • objective function
  • function approximation
  • temporal difference
  • neural network
  • fuzzy logic
  • least squares
  • mathematical model
  • actor critic