Login / Signup

Mechanism Design for LLM Fine-tuning with Multiple Reward Models.

Haoran SunYurong ChenSiwei WangWei ChenXiaotie Deng
Published in: CoRR (2024)
Keyphrases
  • fine tuning
  • mechanism design
  • multiagent planning
  • reinforcement learning
  • fine tune
  • evolutionary algorithm
  • special case
  • np hard