Login / Signup
Learning Preference Model for LLMs via Automatic Preference Data Generation.
Shijia Huang
Jianqiao Zhao
Yanyang Li
Liwei Wang
Published in:
EMNLP (2023)
Keyphrases
</>
data generation
preference learning
preference model
learning algorithm
learning process
prior knowledge
active learning
reinforcement learning
support vector
supervised learning
unsupervised learning
cross validation
learning tasks
learning problems