Login / Signup
Statistical Estimation of Confounded Linear MDPs: An Instrumental Variable Approach.
Miao Lu
Wenhao Yang
Liangyu Zhang
Zhihua Zhang
Published in:
CoRR (2022)
Keyphrases
</>
statistical estimation
markov decision processes
image segmentation
state space
reinforcement learning
markov chain
linear constraints
dynamic programming
continuous variables
linear systems
policy iteration
planning under uncertainty
policy search