Sign in

Counterexamples for Expected Rewards.

Tim QuatmannNils JansenChristian DehnertRalf WimmerErika ÁbrahámJoost-Pieter KatoenBernd Becker
Published in: FM (2015)
Keyphrases
  • reinforcement learning
  • markov decision processes
  • multiarmed bandit
  • data sets
  • neural network
  • computer vision
  • image processing
  • information technology