Is RLHF More Difficult than Standard RL? A Theoretical Perspective.

Yuanhao Wang Qinghua Liu Chi Jin

Published in: NeurIPS (2023)

Keyphrases

reinforcement learning
theoretical analysis
database
case study
multi agent
state space
genetic algorithm
decision trees
viewpoint
learning process
action selection