Login / Signup
Contextual Bandits with Continuous Actions: Smoothing, Zooming, and Adapting.
Akshay Krishnamurthy
John Langford
Aleksandrs Slivkins
Chicheng Zhang
Published in:
J. Mach. Learn. Res. (2020)
Keyphrases
</>
continuous action
action space
contextual information
situation calculus
goal directed
reasoning about actions
nonparametric regression
real time
neural network
dynamic programming
state space
context sensitive
context dependent
action selection
image filtering
stochastic systems