A Theoretical Analysis of Nash Learning from Human Feedback under General KL-Regularized Preference.

Chenlu Ye Wei Xiong Yuheng Zhang Nan Jiang Tong Zhang

Published in: CoRR (2024)

Keyphrases

theoretical analysis
learning algorithm
learning systems
learning process
special case
decision trees
numerical simulations
language acquisition
least squares
supervised learning
assessment tool
human learning
learning tasks
decision theoretic
creative problem solving
motor skills
inductive inference
human subjects
machine learning
knowledge acquisition
online learning
active learning