Login / Signup
Active Model Estimation in Markov Decision Processes.
Jean Tarbouriech
Shubhanshu Shekhar
Matteo Pirotta
Mohammad Ghavamzadeh
Alessandro Lazaric
Published in:
UAI (2020)
Keyphrases
</>
markov decision processes
probabilistic model
optimal policy
machine learning
reinforcement learning
objective function
dynamic programming