Sign in

Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback.

Yifu YuanJianye HaoYi MaZibin DongHebin LiangJinyi LiuZhixin FengKai ZhaoYan Zheng
Published in: CoRR (2024)
Keyphrases