Login / Signup
Model-Based Policy Gradients with Parameter-Based Exploration by Least-Squares Conditional Density Estimation.
Syogo Mori
Voot Tangkaratt
Tingting Zhao
Jun Morimoto
Masashi Sugiyama
Published in:
CoRR (2013)
Keyphrases
</>
least squares
density ratio estimation
density ratio
conditional density estimation
policy evaluation
linear model
action selection
policy iteration
linear regression
optical flow
ls svm
optimal policy
machine learning
model free
outlier detection
semi parametric
dynamic programming
feature vectors
pairwise