Login / Signup
Playing Flappy Bird via Asynchronous Advantage Actor Critic Algorithm.
Elit Cenk Alp
Mehmet Serdar Güzel
Published in:
CoRR (2019)
Keyphrases
</>
learning algorithm
optimal solution
dynamic programming
neural network
computational complexity
reinforcement learning
objective function
multi agent systems
search space
k means
cost function
linear programming
policy gradient