Login / Signup
Sample-Efficient Preference-based Reinforcement Learning with Dynamics Aware Rewards.
Katherine Metcalf
Miguel Sarabia
Natalie Mackraz
Barry-John Theobald
Published in:
CoRL (2023)
Keyphrases
</>
reinforcement learning
markov decision processes
model free
state space
supervised learning
cost effective
reinforcement learning algorithms
computationally efficient
function approximation
machine learning
learning algorithm
semi supervised
optimal policy
dynamic model