TIDBD: Adapting Temporal-difference Step-sizes Through Stochastic Meta-descent.

Published in: CoRR (2018)

Keyphrases