Utility-Based Reinforcement Learning: Unifying Single-objective and Multi-objective Reinforcement Learning.

Peter Vamplew Cameron Foale Conor F. Hayes Patrick Mannion Enda Howley Richard Dazeley Scott Johnson Johan Källström Gabriel de Oliveira Ramos Roxana Radulescu Willem Röpke Diederik M. Roijers

Published in: CoRR (2024)