Login / Signup
An Online Prediction Algorithm for Reinforcement Learning with Linear Function Approximation using Cross Entropy Method.
Ajin George Joseph
Shalabh Bhatnagar
Published in:
CoRR (2018)
Keyphrases
</>
function approximation
cross entropy
reinforcement learning
prediction algorithm
function approximators
model free
cost function
log likelihood
pairwise
active learning
text classification
closed form
temporal difference
similarity measure
high dimensional
error function
training set