Login / Signup
On Wasserstein Reinforcement Learning and the Fokker-Planck equation.
Pierre H. Richemond
Brendan Maginnis
Published in:
CoRR (2017)
Keyphrases
</>
reinforcement learning
function approximation
pointwise
learning algorithm
state space
markov decision processes
dynamic programming
markov chain
machine learning
object recognition
probability distribution
three dimensional
x ray
segmentation algorithm
optimal policy
computer vision
fokker planck equation