Login / Signup

Taylor TD-learning.

Michele GaribboMaxime RobeynsLaurence Aitchison
Published in: CoRR (2023)
Keyphrases
  • td learning
  • temporal difference
  • evaluation function
  • function approximation
  • reinforcement learning
  • multi step
  • policy evaluation
  • reinforcement learning algorithms
  • model free
  • neural network
  • monte carlo
  • step size