Forward and Backward State Abstractions for Off-policy Evaluation.

Meiling Hao Pingfan Su Liyuan Hu Zoltan Szabo Qingyuan Zhao Chengchun Shi

Published in: CoRR (2024)

Keyphrases

forward and backward
policy evaluation
state space
reinforcement learning
least squares
supervised learning
model free
neural network
optical flow
function approximation
temporal difference
greedy search