Login / Signup
Off-Policy Policy Gradient with State Distribution Correction.
Yao Liu
Adith Swaminathan
Alekh Agarwal
Emma Brunskill
Published in:
CoRR (2019)
Keyphrases
</>
policy gradient
state space
probability distribution
optimal policy
approximation methods