Login / Signup
Off-Policy Fitted Q-Evaluation with Differentiable Function Approximators: Z-Estimation and Inference Theory.
Ruiqi Zhang
Xuezhou Zhang
Chengzhuo Ni
Mengdi Wang
Published in:
CoRR (2022)
Keyphrases
</>
function approximation
function approximators
neural network
training data
bayesian networks
reinforcement learning
support vector
supervised learning
computational intelligence
parameter estimation
random variables
conditional random fields
temporal difference methods