Login / Signup

Proxy-RLHF: Decoupling Generation and Alignment in Large Language Model with Proxy.

Yu ZhuChuxiong SunWenfei YangWenqiang WeiBo TangTianzhu ZhangZhiyu LiShifeng ZhangFeiyu XiongJie HuMingchuan Yang
Published in: CoRR (2024)
Keyphrases