Login / Signup
Contextual Multi-armed Bandit Algorithm for Semiparametric Reward Model.
Gi-Soo Kim
Myunghee Cho Paik
Published in:
ICML (2019)
Keyphrases
</>
probabilistic model
semi parametric
objective function
multi armed bandit
learning algorithm
detection algorithm
similarity measure
input data
bayesian framework
parameter estimation
linear model
least squares
closed form
k means
probability distribution
markov random field
optimal solution
model free
neural network