Combinational Q-Learning for Dou Di Zhu.
Yang YouLiangwei LiBaisong GuoWeiming WangCewu LuPublished in: CoRR (2019)
Keyphrases
- reinforcement learning
- cooperative
- function approximation
- state space
- action selection
- model free
- multi agent
- reinforcement learning algorithms
- learning rate
- learning algorithm
- stochastic approximation
- optimal policy
- ieee trans
- multi agent reinforcement learning
- decision making
- temporal difference learning
- continuous state and action spaces
- real time
- learning problems
- decision trees
- single agent
- genetic algorithm
- data mining
- data sets
- asynchronous circuits
- potential field
- td learning
- bucket brigade
- database