A comparison of learning performance in two-dimensional Q-learning by the difference of Q-values alignment.
Kathy Thi AungTakayasu FuchidaPublished in: Artif. Life Robotics (2012)
Keyphrases
- learning algorithm
- reinforcement learning
- learning systems
- learning process
- solving problems
- multiagent learning
- learning problems
- optimal policy
- learning tasks
- eligibility traces
- action selection
- learning community
- knowledge acquisition
- online learning
- multi dimensional
- supervised learning
- state space
- bayesian networks