Login / Signup

Efficiently Breaking the Curse of Horizon in Off-Policy Evaluation with Double Reinforcement Learning.

Nathan KallusMasatoshi Uehara
Published in: Oper. Res. (2022)
Keyphrases