Login / Signup
Hallucinated adversarial control for conservative offline policy evaluation.
Jonas Rothfuss
Bhavya Sukhija
Tobias Birchler
Parnian Kassraie
Andreas Krause
Published in:
UAI (2023)
Keyphrases
</>
policy evaluation
control system
matrix inversion
neural network
reinforcement learning
temporal difference
variance reduction
computer vision
moving objects
dynamic programming
control strategy
constrained optimization
model free