Login / Signup

Calculus on MDPs: Potential Shaping as a Gradient.

Erik JennerHerke van HoofAdam Gleave
Published in: CoRR (2022)
Keyphrases
  • markov decision processes
  • reinforcement learning
  • state space
  • image segmentation
  • optimal policy
  • gradient information
  • neural network
  • multiscale
  • dynamic programming
  • factored mdps
  • state and action spaces