Contextual Multi-armed Bandit Algorithm for Semiparametric Reward Model.

Gi-Soo Kim Myunghee Cho Paik

Published in: ICML (2019)

Keyphrases

probabilistic model
semi parametric
objective function
multi armed bandit
learning algorithm
detection algorithm
similarity measure
input data
bayesian framework
parameter estimation
linear model
least squares
closed form
k means
probability distribution
markov random field
optimal solution
model free
neural network