Solving the scalarization issues of Advantage-based Reinforcement Learning Algorithms.
Federico A. GalatoloMario G. C. A. CiminoGigliola VagliniPublished in: CoRR (2020)
Keyphrases
- reinforcement learning algorithms
- reinforcement learning
- state space
- markov decision processes
- model free
- reinforcement learning problems
- eligibility traces
- temporal difference
- learning algorithm
- reinforcement learning methods
- multi objective
- function approximation
- partially observable environments
- machine learning
- solving problems
- reward function
- dynamic systems
- dynamic programming
- policy search
- cost function
- data mining