Login / Signup
Hallucinated Adversarial Control for Conservative Offline Policy Evaluation.
Jonas Rothfuss
Bhavya Sukhija
Tobias Birchler
Parnian Kassraie
Andreas Krause
Published in:
CoRR (2023)
Keyphrases
</>
policy evaluation
control system
least squares
temporal difference
matrix inversion
probabilistic model
machine learning
model free
evaluation function
variance reduction
semi parametric
policy iteration
control strategy
markov decision processes
monte carlo
search space
reinforcement learning