Login / Signup
Consistency of HDP applied to a simple reinforcement learning problem.
Paul J. Werbos
Published in:
Neural Networks (1990)
Keyphrases
</>
reinforcement learning
information systems
dynamic programming
learning algorithm
bayesian networks
multi agent
markov decision processes
function approximation