• search
    search
  • reviewers
    reviewers
  • feeds
    feeds
  • assignments
    assignments
  • settings
  • logout

Interactive Multi-objective Reinforcement Learning in Multi-armed Bandits with Gaussian Process Utility Models.

Diederik M. RoijersLuisa M. ZintgrafPieter LibinMathieu ReymondEugenio BargiacchiAnn Nowé
Published in: ECML/PKDD (3) (2020)
Keyphrases