Self-Evolution Fine-Tuning for Policy Optimization.

Ruijun ChenJiehao LiangShiping GaoFanqi WanXiaojun Quan
Published in: CoRR (2024)
Keyphrases
  • fine tuning
  • viable alternative
  • fine tune
  • linear programming
  • optimal policy
  • real time
  • learning algorithm
  • objective function
  • information technology
  • cost function
  • optimization problems
  • infinite horizon