Login / Signup
Model, Data and Reward Repair: Trusted Machine Learning for Markov Decision Processes.
Shalini Ghosh
Susmit Jha
Ashish Tiwari
Patrick Lincoln
Xiaojin Zhu
Published in:
DSN Workshops (2018)
Keyphrases
</>
markov decision processes
machine learning
reinforcement learning
probability distribution
dynamic programming
probabilistic model
objective function
optimal policy
function approximation
infinite horizon
learning algorithm
decision theoretic
model free
planning under uncertainty
decision theoretic planning