Utility-Based Reinforcement Learning: Unifying Single-objective and Multi-objective Reinforcement Learning.
Peter VamplewCameron FoaleConor F. HayesPatrick MannionEnda HowleyRichard DazeleyScott JohnsonJohan KällströmGabriel de Oliveira RamosRoxana RadulescuWillem RöpkeDiederik M. RoijersPublished in: CoRR (2024)
Keyphrases
- reinforcement learning
- multi objective
- model free
- function approximation
- state space
- reinforcement learning algorithms
- optimal policy
- multi agent
- markov decision processes
- trade off
- particle swarm optimization
- multi agent reinforcement learning
- temporal difference
- evolutionary algorithm
- least squares
- optimal control
- probability distribution
- genetic algorithm
- action space
- function approximators
- temporal difference learning
- neural network