Login / Signup
MULEX: Disentangling Exploitation from Exploration in Deep RL.
Lucas Beyer
Damien Vincent
Olivier Teboul
Sylvain Gelly
Matthieu Geist
Olivier Pietquin
Published in:
CoRR (2019)
Keyphrases
</>
exploration exploitation tradeoff
reinforcement learning
function approximation
objective function
relevance feedback
action selection
exploration strategy
autonomous learning
exploration exploitation
deep learning
database
e learning
multi agent
learning process
active learning
decision trees