A Reinforcement Learning Scheme for Active Multi-Debris Removal Mission Planning With Modified Upper Confidence Bound Tree Search.
Jianan YangXiaolei HouYu Hen HuYong LiuQuan PanPublished in: IEEE Access (2020)
Keyphrases
- learning scheme
- tree search
- mission planning
- upper confidence bound
- learning algorithm
- contextual bandit
- branch and bound
- search algorithm
- constraint propagation
- search tree
- mathematical programming
- reinforcement learning
- decision making
- state space
- search space
- stochastic local search
- markov chain
- rough sets
- path finding