Sign in

Is RLHF More Difficult than Standard RL?

Yuanhao WangQinghua LiuChi Jin
Published in: CoRR (2023)
Keyphrases
  • reinforcement learning
  • real time
  • cooperative
  • markov decision processes
  • computer vision
  • knowledge base
  • case study
  • optimal solution
  • expert systems