Exploration in Deep Reinforcement Learning: A Survey.

Pawel Ladosz Lilian Weng Minwoo Kim Hyondong Oh

Published in: CoRR (2022)

Keyphrases

reinforcement learning
active exploration
exploration strategy
exploration exploitation
model based reinforcement learning
action selection
function approximation
markov decision processes
learning algorithm
autonomous learning
temporal difference
machine learning
state space
exploration exploitation tradeoff
multi agent reinforcement learning
model free
multi agent
robotic control
neural network
temporal difference learning
optimal control
optimal policy
dynamic programming
real time
deep learning
database
policy search
website
information systems
data mining