STARC: A General Framework For Quantifying Differences Between Reward Functions.

Published in: ICLR (2024)

Keyphrases