Pseudorehearsal in value function approximation.

Vladimir Marochko Leonard Johard Manuel Mazzara

Published in: CoRR (2017)

Keyphrases

basis functions
approximate dynamic programming
state space
temporal difference
linear combination
temporal difference learning
genetic algorithm
data sets
reinforcement learning
artificial neural networks
state action