Login / Signup
Focused Context Balancing for Robust Offline Policy Evaluation.
Hao Zou
Kun Kuang
Boqi Chen
Peixuan Chen
Peng Cui
Published in:
KDD (2019)
Keyphrases
</>
policy evaluation
model free
temporal difference
neural network
artificial neural networks
least squares
reinforcement learning
function approximation
matrix inversion