Login / Signup
Adaptive trajectory-constrained exploration strategy for deep reinforcement learning.
Guojian Wang
Faguo Wu
Xiao Zhang
Ning Guo
Zhiming Zheng
Published in:
Knowl. Based Syst. (2024)
Keyphrases
</>
exploration strategy
reinforcement learning
unknown environments
state space
function approximation
reinforcement learning algorithms
locally optimal
markov decision processes
machine learning
learning algorithm
multi agent
optimal policy
dynamic programming
multiple robots