Login / Signup
Asymptotically Unbiased Off-Policy Policy Evaluation when Reusing Old Data in Nonstationary Environments.
Vincent Liu
Yash Chandak
Philip S. Thomas
Martha White
Published in:
CoRR (2023)
Keyphrases
</>
non stationary
probability distribution
least squares
random fields