Login / Signup
Fine-tuning Reinforcement Learning Models is Secretly a Forgetting Mitigation Problem.
Maciej Wolczyk
Bartlomiej Cupial
Mateusz Ostaszewski
Michal Bortkiewicz
Michal Zajac
Razvan Pascanu
Lukasz Kucinski
Piotr Milos
Published in:
CoRR (2024)
Keyphrases
</>
fine tuning
reinforcement learning
statistical models
experimental data
complex systems
real time
neural network
probability distribution
d objects
least squares
computational models
accurate models