Model-Free HVAC Optimizer based on Reinforcement Learning.
Charalampos MarantosChristos P. LamprakosKostas SioziosDimitrios SoudrisPublished in: ISIE (2023)
Keyphrases
- model free
- reinforcement learning
- reinforcement learning algorithms
- function approximation
- temporal difference
- policy iteration
- state space
- policy evaluation
- reinforcement learning methods
- rl algorithms
- partially observable
- genetic algorithm
- temporal difference learning
- average reward
- markov decision processes
- learning algorithm
- markov chain
- action space
- learning process
- pattern recognition
- multi agent
- machine learning
- impedance control