Login / Signup

Beyond Human Preferences: Exploring Reinforcement Learning Trajectory Evaluation and Improvement through LLMs.

Zichao ShenTianchen ZhuQingyun SunShiqi GaoJianxin Li
Published in: CoRR (2024)
Keyphrases