2nd Workshop on Multi-Armed Bandits and Reinforcement Learning: Advancing Decision Making in E-Commerce and Beyond.
Chu WangYingfei WangHaipeng LuoDaniel JiangJinghai HeZeyu ZhengPublished in: KDD (2023)
Keyphrases
- multi armed bandits
- reinforcement learning
- decision making
- multi armed bandit
- action selection
- bandit problems
- decision makers
- electronic commerce
- function approximation
- state space
- model free
- reinforcement learning algorithms
- temporal difference
- markov decision processes
- influence diagrams
- optimal policy
- supervised learning
- artificial intelligence
- machine learning
- optimal control
- data mining
- dynamic programming
- markov decision problems
- learning algorithm