Login / Signup

Supervised Fine-Tuning as Inverse Reinforcement Learning.

Hao Sun
Published in: CoRR (2024)
Keyphrases