Iterative Reward Shaping Using Human Feedback for Correcting Reward Misspecification.

Published in: ECAI (2023)

Keyphrases