Calculus on MDPs: Potential Shaping as a Gradient.

Erik Jenner Herke van Hoof Adam Gleave

Published in: CoRR (2022)

Keyphrases

markov decision processes
reinforcement learning
state space
image segmentation
optimal policy
gradient information
neural network
multiscale
dynamic programming
factored mdps
state and action spaces