Login / Signup
Pseudorehearsal in value function approximation.
Vladimir Marochko
Leonard Johard
Manuel Mazzara
Published in:
CoRR (2017)
Keyphrases
</>
basis functions
approximate dynamic programming
state space
temporal difference
linear combination
temporal difference learning
genetic algorithm
data sets
reinforcement learning
artificial neural networks
state action