Hypervolume indicator and dominance reward based multi-objective Monte-Carlo Tree Search.
Weijia WangMichèle SebagPublished in: Mach. Learn. (2013)
Keyphrases
- monte carlo tree search
- multi objective
- monte carlo
- hypervolume indicator
- evolutionary algorithm
- reinforcement learning
- evaluation function
- multi objective optimization
- bayesian reinforcement learning
- particle swarm optimization
- genetic algorithm
- objective function
- reference point
- temporal difference
- game tree
- reinforcement learning methods
- temporal difference learning
- monte carlo search
- linear programming
- neural network
- control problems