Login / Signup

A Priority Experience Replay Sampling Method Based on Upper Confidence Bound.

Fengkai KeDaxing ZhaoGuodong SunWei Feng
Published in: ICDLT (2019)
Keyphrases
  • upper confidence bound
  • contextual bandit
  • preemptive scheduling
  • data mining
  • priority queue