Solving the scalarization issues of Advantage-based Reinforcement Learning Algorithms.

Federico A. Galatolo Mario G. C. A. Cimino Gigliola Vaglini

Published in: CoRR (2020)

Keyphrases

reinforcement learning algorithms
reinforcement learning
state space
markov decision processes
model free
reinforcement learning problems
eligibility traces
temporal difference
learning algorithm
reinforcement learning methods
multi objective
function approximation
partially observable environments
machine learning
solving problems
reward function
dynamic systems
dynamic programming
policy search
cost function
data mining