Login / Signup
Making RL with Preference-based Feedback Efficient via Randomization.
Runzhe Wu
Wen Sun
Published in:
CoRR (2023)
Keyphrases
</>
cost effective
reinforcement learning
case study
lightweight
privacy preserving
computationally expensive
databases
machine learning
image restoration
optimal policy
user feedback
optimal control