Hill Climbing on Value Estimates for Search-control in Dyna.
Yangchen PanHengshuai YaoAmir-massoud FarahmandMartha WhitePublished in: CoRR (2019)
Keyphrases
- hill climbing
- search space
- search algorithm
- search strategy
- simulated annealing
- search procedure
- direct search
- max min
- path finding
- branching factor
- systematic search
- hill climbing algorithm
- genetic algorithm ga
- hybrid algorithms
- tabu search
- beam search
- greedy search
- rule learning
- steepest ascent
- temporal difference learning
- exhaustive search
- optimal control
- search strategies
- fitness function
- bayesian network structure learning
- control system
- artificial neural networks