Login / Signup
A Hybrid Evolving and Gradient Strategy for Approximating Policy Evaluation on Online Critic-Actor Learning.
Jian Fu
Haibo He
Huiying Li
Qing Liu
Published in:
ISNN (1) (2012)
Keyphrases
</>
learning algorithm
policy gradient
reinforcement learning
learning process
least squares
function approximation
temporal difference
belief revision