Goodhart's Law in Reinforcement Learning.

Jacek Karwowski Oliver Hayman Xingjian Bai Klaus Kiendlhofer Charlie Griffin Joar Max Viktor Skalse

Published in: ICLR (2024)

Keyphrases

supervised learning
reinforcement learning
learning algorithm
machine learning
temporal difference
state space
function approximation
model free
active learning
reinforcement learning algorithms
transfer learning
control problems
temporal difference learning
learning process
continuous state
multi agent
learning agents
policy search
function approximators
multi agent reinforcement learning
legal reasoning
partially observable
markov decision processes
optimal policy
search algorithm
case study
artificial intelligence