A Reinforcement Learning Approach to Optimize Discount and Reputation Tradeoffs in E-commerce Systems.
Hong XieYongkun LiJohn C. S. LuiPublished in: ACM Trans. Internet Techn. (2020)
Keyphrases
- reinforcement learning
- trust model
- function approximation
- state space
- electronic commerce
- optimal policy
- temporal difference
- machine learning
- robotic control
- multi agent
- learning algorithm
- learning process
- transfer learning
- markov decision processes
- model free
- reputation systems
- multi agent reinforcement learning
- trust evaluation
- reputation models
- multi agent systems
- temporal difference learning
- stochastic approximation
- cost benefit
- real time