Login / Signup
Is RLHF More Difficult than Standard RL? A Theoretical Perspective.
Yuanhao Wang
Qinghua Liu
Chi Jin
Published in:
NeurIPS (2023)
Keyphrases
</>
reinforcement learning
theoretical analysis
database
case study
multi agent
state space
genetic algorithm
decision trees
viewpoint
learning process
action selection