Login / Signup
Non-Deterministic Policies in Markovian Decision Processes.
Mahdi Milani Fard
Joelle Pineau
Published in:
CoRR (2014)
Keyphrases
</>
markovian decision processes
markov decision problems
optimal policy
monte carlo
black box
reinforcement learning
learning algorithm
lower bound
multi objective
stationary policies