Login / Signup

Making RL with Preference-based Feedback Efficient via Randomization.

Runzhe WuWen Sun
Published in: CoRR (2023)
Keyphrases
  • cost effective
  • reinforcement learning
  • case study
  • lightweight
  • privacy preserving
  • computationally expensive
  • databases
  • machine learning
  • image restoration
  • optimal policy
  • user feedback
  • optimal control