Login / Signup
Mechanism Design for LLM Fine-tuning with Multiple Reward Models.
Haoran Sun
Yurong Chen
Siwei Wang
Wei Chen
Xiaotie Deng
Published in:
CoRR (2024)
Keyphrases
</>
fine tuning
mechanism design
multiagent planning
reinforcement learning
fine tune
evolutionary algorithm
special case
np hard