Login / Signup
Adaptive trajectory-constrained exploration strategy for deep reinforcement learning.
Guojian Wang
Faguo Wu
Xiao Zhang
Ning Guo
Zhiming Zheng
Published in:
CoRR (2023)
Keyphrases
</>
exploration strategy
reinforcement learning
unknown environments
locally optimal
function approximation
learning capabilities
model free
markov decision processes
machine learning
state space
single agent
multi agent
learning algorithm
optimal policy
action selection
mobile robot
optimal solution
multiple robots