Login / Signup
Causal Confusion and Reward Misidentification in Preference-Based Reward Learning.
Jeremy Tien
Jerry Zhi-Yang He
Zackory Erickson
Anca D. Dragan
Daniel S. Brown
Published in:
ICLR (2023)
Keyphrases
</>
reinforcement learning
neural network
learning algorithm
learning systems
learning scheme
data sets
data mining
mobile robot
knowledge acquisition
unsupervised learning
incremental learning
inductive inference
learning agent
inverse reinforcement learning
causal knowledge
eligibility traces