A Unified Linear Programming Framework for Offline Reward Learning from Human Demonstrations and Feedback.

Published in: CoRR (2024)

Keyphrases