Login / Signup

Batch Active Learning of Reward Functions from Human Preferences.

Erdem BiyikNima AnariDorsa Sadigh
Published in: CoRR (2024)
Keyphrases