Login / Signup
Optimistic Initialization and Greediness Lead to Polynomial Time Learning in Factored MDPs - Extended Version
Istvan Szita
András Lörincz
Published in:
CoRR (2009)
Keyphrases
</>
learning algorithm
learning process
learning tasks
decision making
reinforcement learning
special case
markov chain
markov decision processes
factored mdps