Login / Signup
When the Best Move Isn't Optimal: Q-learning with Exploration.
George H. John
Published in:
AAAI (1994)
Keyphrases
</>
action selection
multi agent
learning algorithm
reinforcement learning
cooperative
dynamic programming
exploration strategy
optimal solution
database
machine learning
search engine
closed form
optimal design