Improvement of the LPWAN AMI backhaul's latency thanks to reinforcement learning algorithms.
Rémi BonnefoiChristophe MoyJacques PalicotPublished in: EURASIP J. Wirel. Commun. Netw. (2018)
Keyphrases
- reinforcement learning algorithms
- reinforcement learning
- state space
- model free
- markov decision processes
- reinforcement learning problems
- ambient intelligence
- learning algorithm
- reinforcement learning methods
- temporal difference
- eligibility traces
- function approximation
- partially observable environments
- reward function
- stochastic games
- function approximators
- reward shaping
- dynamic environments
- policy search
- dynamic programming
- multiagent reinforcement learning
- tabula rasa
- markov chain
- training data