Sign in

A Hybrid Evolving and Gradient Strategy for Approximating Policy Evaluation on Online Critic-Actor Learning.

Jian FuHaibo HeHuiying LiQing Liu
Published in: ISNN (1) (2012)
Keyphrases
  • learning algorithm
  • policy gradient
  • reinforcement learning
  • learning process
  • least squares
  • function approximation
  • temporal difference
  • belief revision