Reinforcement Learning of Optimal Supervisor Based on Language Measure.
Tatsushi YamasakiKazutaka TaniguchiToshimitsu UshioPublished in: CDC/ECC (2005)
Keyphrases
- reinforcement learning
- dynamic programming
- optimal control
- optimal solution
- natural language
- state space
- programming language
- similarity measure
- distance measure
- function approximation
- reinforcement learning algorithms
- reward function
- closed form
- optimal policy
- logic programming
- optimal design
- learning algorithm
- genetic algorithm