Login / Signup
Oracle-Efficient Reinforcement Learning in Factored MDPs with Unknown Structure.
Aviv Rosenberg
Yishay Mansour
Published in:
CoRR (2020)
Keyphrases
</>
factored mdps
reinforcement learning
markov decision processes
state space
approximate dynamic programming
machine learning
context specific
learning algorithm
optimal policy
hidden markov models
learning capabilities
policy iteration