Login / Signup
Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning.
Mitsuhiko Nakamoto
Simon Zhai
Anikait Singh
Max Sobol Mark
Yi Ma
Chelsea Finn
Aviral Kumar
Sergey Levine
Published in:
NeurIPS (2023)
Keyphrases
</>
fine tuning
online learning
fine tuned
real time
viable alternative
online training
supervised learning
reinforcement learning
batch mode
support vector machine
query language
test set
training phase
training set
multi agent
fine tune
databases