C
search
search
reviewers
reviewers
feeds
feeds
assignments
assignments
settings
logout
Off-Policy Policy Gradient with State Distribution Correction.
Yao Liu
Adith Swaminathan
Alekh Agarwal
Emma Brunskill
Published in:
CoRR (2019)
Keyphrases
</>
policy gradient
state space
probability distribution
optimal policy
approximation methods