Login / Signup
Combining Off and On-Policy Training in Model-Based Reinforcement Learning.
Alexandre Borges
Arlindo L. Oliveira
Published in:
CoRR (2021)
Keyphrases
</>
model based reinforcement learning
markov decision processes
supervised learning
policy iteration
training set
dynamic programming
optimal policy
markov decision process
bayesian networks
relational databases