Login / Signup
The State-Action-Reward-State-Action Algorithm in Spatial Prisoner's Dilemma Game.
Lanyu Yang
Dongchun Jiang
Fuqiang Guo
Mingjian Fu
Published in:
CoRR (2024)
Keyphrases
</>
state action
average reward
reinforcement learning
policy gradient
computational complexity
belief state
dynamic programming
optimal solution
evaluation function
learning algorithm
search space
long run
probabilistic model
distance metric
path finding
stochastic games