Login / Signup
Combinatorial Pure Exploration with Continuous and Separable Reward Functions and Its Applications (Extended Version).
Weiran Huang
Jungseul Ok
Liang Li
Wei Chen
Published in:
CoRR (2018)
Keyphrases
</>
reward function
reinforcement learning
state space
markov decision processes
machine learning
multiple agents
markov decision process
learning algorithm
web pages
probability distribution
transition probabilities
inverse reinforcement learning