Variations on the Reinforcement Learning performance of Blackjack.
Avish BuramdoyalTim GebbiePublished in: CoRR (2023)
Keyphrases
- reinforcement learning
- function approximation
- state space
- learning algorithm
- machine learning
- policy search
- multi agent
- temporal difference
- optimal policy
- view angle
- direct policy search
- real time
- multi agent reinforcement learning
- autonomous learning
- continuous state
- stochastic approximation
- reinforcement learning methods
- function approximators
- action selection
- markov decision processes
- transfer learning
- face recognition