Login / Signup

Can You Rely on Synthetic Labellers in Preference-Based Reinforcement Learning? It's Complicated.

Katherine MetcalfMiguel SarabiaMasha FedzechkinaBarry-John Theobald
Published in: AAAI (2024)
Keyphrases