ENOTO: Improving Offline-to-Online Reinforcement Learning with Q-Ensembles.

Kai Zhao Jianye Hao Yi Ma Jinyi Liu Yan Zheng Zhaopeng Meng

Published in: AAMAS (2024)

Keyphrases

reinforcement learning
real time
learning algorithm
function approximation
decision trees
state space
online learning
temporal difference
reinforcement learning algorithms
balancing exploration and exploitation
multi class
markov decision processes
ensemble methods
ensemble learning
learning classifier systems
temporal difference learning