Login / Signup

PLaD: Preference-based Large Language Model Distillation with Pseudo-Preference Pairs.

Rongzhi ZhangJiaming ShenTianqi LiuHaorui WangZhen QinFeng HanJialu LiuSimon BaumgartnerMichael BenderskyChao Zhang
Published in: CoRR (2024)
Keyphrases