Login / Signup
Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size.
Alexander Nikulin
Vladislav Kurenkov
Denis Tarasov
Dmitry Akimov
Sergey Kolesnikov
Published in:
CoRR (2022)
Keyphrases
</>
learning algorithm
reinforcement learning
training data
learning process
dynamic programming
batch size