• search
    search
  • reviewers
    reviewers
  • feeds
    feeds
  • assignments
    assignments
  • settings
  • logout

Scalar reward is not enough: a response to Silver, Singh, Precup and Sutton (2021).

Peter VamplewBenjamin J. SmithJohan KällströmGabriel de Oliveira RamosRoxana RadulescuDiederik M. RoijersConor F. HayesFredrik HeintzPatrick MannionPieter J. K. LibinRichard DazeleyCameron Foale
Published in: Auton. Agents Multi Agent Syst. (2022)
Keyphrases
  • graphical models
  • reinforcement learning
  • temporal difference
  • vector valued
  • real time
  • database
  • databases
  • data mining
  • data structure
  • decision making
  • image processing
  • medical images
  • bandit problems