Login / Signup
Online Prompt Pricing based on Combinatorial Multi-Armed Bandit and Hierarchical Stackelberg Game.
Meiling Li
Hongrun Ren
Haixu Xiong
Zhenxing Qian
Xinpeng Zhang
Published in:
CoRR (2024)
Keyphrases
</>
stackelberg game
multi armed bandit
multi armed bandits
supply chain
online learning
pricing model
nash equilibrium
reinforcement learning
learning algorithm
multi class