Optimal Demand Response Using Device Based Reinforcement Learning.
Zheng WenDaniel O'NeillHamid Reza MaeiPublished in: CoRR (2014)
Keyphrases
- reinforcement learning
- optimal control
- dynamic programming
- state space
- reinforcement learning algorithms
- genetic algorithm
- worst case
- machine learning
- optimal solution
- profit maximizing
- budget constraints
- inventory systems
- holding cost
- average reward
- lead time
- function approximation
- markov decision processes
- optimal policy
- markov chain
- supply chain
- multi agent