Learning Best Response Policies in Dynamic Auctions via Deep Reinforcement Learning.

Vinzenz Thoma Michael Curry Niao He Sven Seuken

Published in: CoRR (2023)

Keyphrases

reinforcement learning
learning process
learning algorithm
learning systems
learning problems
supervised learning
learning tasks
function approximation
online learning
learning capabilities
learning agents
policy gradient methods
state space
macro actions
hierarchical reinforcement learning
policy search
multi agent reinforcement learning
reinforcement learning methods
action selection
neural network
multi agent
dynamic environments