Login / Signup
A Priority Experience Replay Sampling Method Based on Upper Confidence Bound.
Fengkai Ke
Daxing Zhao
Guodong Sun
Wei Feng
Published in:
ICDLT (2019)
Keyphrases
</>
upper confidence bound
contextual bandit
preemptive scheduling
data mining
priority queue