Login / Signup
SURF: Semi-supervised Reward Learning with Data Augmentation for Feedback-efficient Preference-based Reinforcement Learning.
Jongjin Park
Younggyo Seo
Jinwoo Shin
Honglak Lee
Pieter Abbeel
Kimin Lee
Published in:
ICLR (2022)
Keyphrases
</>
reinforcement learning
semi supervised
data sets
labeled data
learning process
prior knowledge
unlabeled data
supervised learning
database
learning problems
data collection
learning algorithm
co training
data analysis
semi supervised learning
eligibility traces
background knowledge
partially observable environments
state space
unsupervised learning
multi view
function approximation
reinforcement learning methods
data sources
training data