Sign in

Exploiting Correlated Auxiliary Feedback in Parameterized Bandits.

Arun VermaZhongxiang DaiYao ShuBryan Kian Hsiang Low
Published in: CoRR (2023)
Keyphrases
  • machine learning
  • stochastic systems
  • feedback mechanisms
  • highly correlated
  • feedback loop
  • decision making
  • expert systems
  • relevance feedback
  • multi armed bandits
  • real time
  • reinforcement learning
  • markov chain