A Theoretical Analysis of Nash Learning from Human Feedback under General KL-Regularized Preference.
Chenlu YeWei XiongYuheng ZhangNan JiangTong ZhangPublished in: CoRR (2024)
Keyphrases
- theoretical analysis
- learning algorithm
- learning systems
- learning process
- special case
- decision trees
- numerical simulations
- language acquisition
- least squares
- supervised learning
- assessment tool
- human learning
- learning tasks
- decision theoretic
- creative problem solving
- motor skills
- inductive inference
- human subjects
- machine learning
- knowledge acquisition
- online learning
- active learning