Sign in

Rewards Encoding Environment Dynamics Improves Preference-based Reinforcement Learning.

Katherine MetcalfMiguel SarabiaBarry-John Theobald
Published in: CoRR (2022)
Keyphrases