Sign in

Reward-agnostic Fine-tuning: Provable Statistical Benefits of Hybrid Reinforcement Learning.

Gen LiWenhao ZhanJason D. LeeYuejie ChiYuxin Chen
Published in: CoRR (2023)
Keyphrases