Login / Signup
Self-Evolution Fine-Tuning for Policy Optimization.
Ruijun Chen
Jiehao Liang
Shiping Gao
Fanqi Wan
Xiaojun Quan
Published in:
CoRR (2024)
Keyphrases
</>
fine tuning
viable alternative
fine tune
linear programming
optimal policy
real time
learning algorithm
objective function
information technology
cost function
optimization problems
infinite horizon