Learning Reward Functions from Diverse Sources of Human Feedback: Optimally Integrating Demonstrations and Preferences.

Published in: CoRR (2020)

Keyphrases