Login / Signup
Robust On-Policy Sampling for Data-Efficient Policy Evaluation in Reinforcement Learning.
Rujie Zhong
Duohan Zhang
Lukas Schäfer
Stefano V. Albrecht
Josiah Hanna
Published in:
NeurIPS (2022)
Keyphrases
</>
reinforcement learning
policy evaluation
optimal policy
function approximation
training data
least squares
monte carlo
statistical methods
machine learning
optical flow
high dimensional data
infinite horizon
model free
action selection