Login / Signup
Policy Evaluation and Optimization with Continuous Treatments.
Nathan Kallus
Angela Zhou
Published in:
CoRR (2018)
Keyphrases
</>
policy evaluation
least squares
temporal difference
reinforcement learning
optimization algorithm
monte carlo
model free
semi parametric
multi agent
computational complexity
linear regression
function approximation
policy iteration