Login / Signup
Beyond Confidence Regions: Tight Bayesian Ambiguity Sets for Robust MDPs.
Marek Petrik
Reazul Hasan Russel
Published in:
NeurIPS (2019)
Keyphrases
</>
markov decision processes
confidence level
lower bound
dynamic programming
upper bound
worst case
maximum likelihood
optimal policy
markov decision process