Login / Signup
Increasingly Cautious Optimism for Practical PAC-MDP Exploration.
Liangpeng Zhang
Ke Tang
Xin Yao
Published in:
IJCAI (2015)
Keyphrases
</>
markov decision processes
state space
data sets
real world
upper bound
sample complexity
markov decision process
decision trees
active learning
special case
utility function
initial state
exploration strategy