Sign in

Enhancing the episodic natural actor-critic algorithm by a regularisation term to stabilize learning of control structures.

Andreas WitschRoland ReichleKurt GeihsSascha LangeMartin A. Riedmiller
Published in: ADPRL (2011)
Keyphrases
  • learning algorithm
  • control structures
  • learning process
  • monte carlo
  • neural network
  • image processing
  • least squares
  • supervised learning
  • natural actor critic