Reinforcement Learning for Self-exploration in Narrow Spaces.
Zhaofeng TianZichuan LiuXingyu ZhouWeisong ShiPublished in: CoRR (2022)
Keyphrases
- reinforcement learning
- active exploration
- exploration strategy
- action selection
- model based reinforcement learning
- exploration exploitation
- function approximation
- autonomous learning
- state space
- markov decision processes
- reinforcement learning algorithms
- optimal policy
- learning algorithm
- exploration exploitation tradeoff
- machine learning
- temporal difference
- dynamic programming
- robotic control
- function approximators
- temporal difference learning
- search strategies
- learning process
- search engine
- balancing exploration and exploitation