Login / Signup
A Periodic Traveling Politician Problem with Time-Dependent Rewards.
Deniz Aksen
Masoud Shahmanzari
Published in:
OR (2016)
Keyphrases
</>
reinforcement learning
multiarmed bandit
travel time
markov decision processes
bandit problems
artificial intelligence
multi agent
quasi periodic
multi armed bandits
database
data sets
social networks
search algorithm
free riding
long term and short term