Models of human preference for learning reward functions.

Published in: Trans. Mach. Learn. Res. (2024)

Keyphrases