Login / Signup
Improving Generalization in Mountain Car Through the Partitioned Parameterized Policy Approach via Quasi-Stochastic Gradient Descent.
Caleb M. Bowyer
Published in:
CoRR (2021)
Keyphrases
</>
stochastic gradient descent
least squares
matrix factorization
loss function
step size
random forests
function approximation
online algorithms
weight vector
collaborative filtering
optimal policy
cost function
evolutionary algorithm
reinforcement learning
regularization parameter
importance sampling
pairwise