Login / Signup

Active teacher selection for reinforcement learning from human feedback.

Rachel FreedmanJustin SvegliatoKyle Hollins WrayStuart Russell
Published in: CoRR (2023)
Keyphrases