Login / Signup

Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint.

Zhipeng ChenKun ZhouWayne Xin ZhaoJunchen WanFuzheng ZhangDi ZhangJi-Rong Wen
Published in: CoRR (2024)
Keyphrases