Login / Signup
Sample Efficient Reinforcement Learning from Human Feedback via Active Exploration.
Viraj Mehta
Vikramjeet Das
Ojash Neopane
Yijia Dai
Ilija Bogunovic
Jeff G. Schneider
Willie Neiswanger
Published in:
CoRR (2023)
Keyphrases
</>
active exploration
reinforcement learning
small sample
active learning
data sets
sequential decision problems
case study
least squares
learning process
support vector machine
sensor networks
supervised learning
model selection