Login / Signup
Optimal Treatment Allocation for Efficient Policy Evaluation in Sequential Decision Making.
Ting Li
Chengchun Shi
Jianing Wang
Fan Zhou
Hongtu Zhu
Published in:
NeurIPS (2023)
Keyphrases
</>
sequential decision making
temporal difference
reinforcement learning
policy evaluation
least squares
optimal solution
dynamic programming
decision problems
probabilistic model
worst case