C
search
search
reviewers
reviewers
feeds
feeds
assignments
assignments
settings
logout
From Importance Sampling to Doubly Robust Policy Gradient.
Jiawei Huang
Nan Jiang
Published in:
CoRR (2019)
Keyphrases
</>
importance sampling
monte carlo
policy gradient
variance reduction
kalman filter
markov chain
particle filter
particle filtering
pairwise
higher order
maximum likelihood
approximate inference
gradient method