Login / Signup

Focused Context Balancing for Robust Offline Policy Evaluation.

Hao ZouKun KuangBoqi ChenPeixuan ChenPeng Cui
Published in: KDD (2019)
Keyphrases
  • policy evaluation
  • model free
  • temporal difference
  • neural network
  • artificial neural networks
  • least squares
  • reinforcement learning
  • function approximation
  • matrix inversion