Models of human preference for learning reward functions.

W. Bradley Knox Stephane Hatgis-Kessell Serena Booth Scott Niekum Peter Stone Alessandro Allievi

Published in: CoRR (2022)

Keyphrases