Login / Signup
Forward and Backward State Abstractions for Off-policy Evaluation.
Meiling Hao
Pingfan Su
Liyuan Hu
Zoltan Szabo
Qingyuan Zhao
Chengchun Shi
Published in:
CoRR (2024)
Keyphrases
</>
forward and backward
policy evaluation
state space
reinforcement learning
least squares
supervised learning
model free
neural network
optical flow
function approximation
temporal difference
greedy search