Can You Rely on Synthetic Labellers in Preference-Based Reinforcement Learning? It's Complicated.

Published in: AAAI (2024)

Keyphrases