A Periodic Traveling Politician Problem with Time-Dependent Rewards.

Deniz Aksen Masoud Shahmanzari

Published in: OR (2016)

Keyphrases

reinforcement learning
multiarmed bandit
travel time
markov decision processes
bandit problems
artificial intelligence
multi agent
quasi periodic
multi armed bandits
database
data sets
social networks
search algorithm
free riding
long term and short term