Login / Signup

Iterative Data Smoothing: Mitigating Reward Overfitting and Overoptimization in RLHF.

Banghua ZhuMichael I. JordanJiantao Jiao
Published in: CoRR (2024)
Keyphrases