Login / Signup
A temporal difference method for multi-objective reinforcement learning.
Manuela Ruiz-Montiel
Lawrence Mandow
José-Luis Pérez-de-la-Cruz
Published in:
Neurocomputing (2017)
Keyphrases
</>
reinforcement learning
temporal difference
multi objective
model free
cost function
neural network
optimization algorithm
machine learning
objective function
pairwise
learning process
function approximation
function approximators
policy evaluation