Reinforcement Learning for Self-exploration in Narrow Spaces.

Zhaofeng Tian Zichuan Liu Xingyu Zhou Weisong Shi

Published in: CoRR (2022)

Keyphrases

reinforcement learning
active exploration
exploration strategy
action selection
model based reinforcement learning
exploration exploitation
function approximation
autonomous learning
state space
markov decision processes
reinforcement learning algorithms
optimal policy
learning algorithm
exploration exploitation tradeoff
machine learning
temporal difference
dynamic programming
robotic control
function approximators
temporal difference learning
search strategies
learning process
search engine
balancing exploration and exploitation