Login / Signup
Balanced Off-Policy Evaluation in General Action Spaces.
Arjun Sondhi
David Arbour
Drew Dimmery
Published in:
AISTATS (2020)
Keyphrases
</>
action space
markov decision processes
policy evaluation
learning algorithm
reinforcement learning
state space
least squares
monte carlo
function approximation
bayesian networks
dynamical systems
learning tasks
model free
semi parametric