Learning Best Response Policies in Dynamic Auctions via Deep Reinforcement Learning.
Vinzenz ThomaMichael CurryNiao HeSven SeukenPublished in: CoRR (2023)
Keyphrases
- reinforcement learning
- learning process
- learning algorithm
- learning systems
- learning problems
- supervised learning
- learning tasks
- function approximation
- online learning
- learning capabilities
- learning agents
- policy gradient methods
- state space
- macro actions
- hierarchical reinforcement learning
- policy search
- multi agent reinforcement learning
- reinforcement learning methods
- action selection
- neural network
- multi agent
- dynamic environments