Optimizing time warp simulation with reinforcement learning techniques.
Jun WangCarl TropperPublished in: WSC (2007)
Keyphrases
- reinforcement learning
- simulation model
- learning algorithm
- real world
- simulation study
- function approximation
- reinforcement learning algorithms
- website
- objective function
- simulation environment
- optimal control
- markov decision processes
- exploration exploitation tradeoff
- databases
- temporal difference
- dynamical systems
- dynamic programming
- search space
- multi agent
- machine learning