Sign in

Optimal distributions of rewards for a two-armed slot machine.

Zengjing ChenXinwei FengShuhui LiuXiaodong Yan
Published in: Neurocomputing (2023)
Keyphrases
  • dynamic programming
  • reinforcement learning
  • optimal solution
  • worst case
  • markov decision processes
  • optimal design
  • flowshop
  • real time
  • objective function
  • random variables
  • closed form
  • power law
  • batch processing