Login / Signup

Online Prompt Pricing based on Combinatorial Multi-Armed Bandit and Hierarchical Stackelberg Game.

Meiling LiHongrun RenHaixu XiongZhenxing QianXinpeng Zhang
Published in: CoRR (2024)
Keyphrases
  • stackelberg game
  • multi armed bandit
  • multi armed bandits
  • supply chain
  • online learning
  • pricing model
  • nash equilibrium
  • reinforcement learning
  • learning algorithm
  • multi class