C
search
search
reviewers
reviewers
feeds
feeds
assignments
assignments
settings
logout
Asymptotically Unbiased Off-Policy Policy Evaluation when Reusing Old Data in Nonstationary Environments.
Vincent Liu
Yash Chandak
Philip S. Thomas
Martha White
Published in:
CoRR (2023)
Keyphrases
</>
non stationary
probability distribution
least squares
random fields