Login / Signup
TOM: Learning Policy-Aware Models for Model-Based Reinforcement Learning via Transition Occupancy Matching.
Yecheng Jason Ma
Kausik Sivakumar
Jason Yan
Osbert Bastani
Dinesh Jayaraman
Published in:
CoRR (2023)
Keyphrases
</>
model based reinforcement learning
prior knowledge
learning algorithm
dynamic programming
probabilistic model
dynamic bayesian networks
bayesian networks
supervised learning
optimal policy
hidden variables
structured prediction