Q-Learning as Failure.

Kei Takahata Takao Miura

Published in: EJC (2020)

Keyphrases

cooperative
multi agent
reinforcement learning
function approximation
state space
learning algorithm
stochastic approximation
optimal policy
root cause
model free
action selection
expert systems
learning process
dynamic programming
failure rate