Sustainable Online Reinforcement Learning for Auto-bidding.
Zhiyu MouYusen HuoRongquan BaiMingzhou XieChuan YuJian XuBo ZhengPublished in: NeurIPS (2022)
Keyphrases
- reinforcement learning
- online learning
- state space
- management system
- function approximation
- reinforcement learning algorithms
- multi agent reinforcement learning
- learning algorithm
- decision making
- cooperative
- optimal policy
- dynamical systems
- markov decision processes
- reinforcement learning methods
- online environment