Login / Signup
Offline-to-Online Reinforcement Learning via Balanced Replay and Pessimistic Q-Ensemble.
Seunghyun Lee
Younggyo Seo
Kimin Lee
Pieter Abbeel
Jinwoo Shin
Published in:
CoRR (2021)
Keyphrases
</>
reinforcement learning
real time
neural network
learning algorithm
online learning
state space
ensemble learning
balancing exploration and exploitation
feature selection
semi supervised
decision makers
ensemble methods
base classifiers