Login / Signup
An online prediction algorithm for reinforcement learning with linear function approximation using cross entropy method.
Ajin George Joseph
Shalabh Bhatnagar
Published in:
Mach. Learn. (2018)
Keyphrases
</>
function approximation
cross entropy
reinforcement learning
function approximators
prediction algorithm
model free
machine learning
log likelihood
control policy
neural network
least squares
classification accuracy
cost function
closed form
em algorithm
dynamic programming
pairwise