Login / Signup
Combinatorial Pure Exploration with Continuous and Separable Reward Functions and Its Applications.
Weiran Huang
Jungseul Ok
Liang Li
Wei Chen
Published in:
IJCAI (2018)
Keyphrases
</>
reward function
markov decision processes
multiple agents
reinforcement learning
inverse reinforcement learning
state space
optimal policy
state variables
machine learning
web pages
hidden markov models
conditional random fields
transition probabilities
reinforcement learning algorithms
policy search