Natural Gradient Deep Q-learning.

Ethan Knight Osher Lerner

Published in: CoRR (2018)

Keyphrases

natural gradient
learning rate
independent component analysis
learning algorithm
blind source separation
convergence rate
function approximation
policy gradient
cooperative
reinforcement learning
convergence speed
fixed point
state space
multi agent
optimal policy
neural network
model free
markov decision processes
pattern recognition
mixing matrix