Login / Signup

A Counterexample to Temporal Differences Learning.

Dimitri P. Bertsekas
Published in: Neural Comput. (1995)
Keyphrases
  • learning process
  • learning algorithm
  • temporal difference
  • reinforcement learning
  • learning tasks
  • td learning
  • multiresolution
  • prior knowledge
  • monte carlo