Sign in

DRLC: Reinforcement Learning with Dense Rewards from LLM Critic.

Meng CaoLei ShuLei YuYun ZhuNevan WichersYinxiao LiuLei Meng
Published in: CoRR (2024)
Keyphrases