Benchmarking reinforcement learning algorithms for demand response applications.
Brida V. MbuwirCarlo MannaFred SpiessensGeert DeconinckPublished in: ISGT-Europe (2020)
Keyphrases
- reinforcement learning algorithms
- reinforcement learning
- model free
- state space
- markov decision processes
- reinforcement learning problems
- eligibility traces
- temporal difference
- learning algorithm
- function approximation
- partially observable environments
- dynamic environments
- reinforcement learning methods
- reward function
- policy search
- tabula rasa
- mobile robot
- reward shaping
- multiagent reinforcement learning
- partially observable
- dynamic programming
- em algorithm
- particle filter
- markov chain
- least squares