C
search
search
reviewers
reviewers
feeds
feeds
assignments
assignments
settings
logout
Learning Preference Model for LLMs via Automatic Preference Data Generation.
Shijia Huang
Jianqiao Zhao
Yanyang Li
Liwei Wang
Published in:
EMNLP (2023)
Keyphrases
</>
data generation
preference learning
preference model
learning algorithm
learning process
prior knowledge
active learning
reinforcement learning
support vector
supervised learning
unsupervised learning
cross validation
learning tasks
learning problems