Sign in

An Online Policy Gradient Algorithm for Markov Decision Processes with Continuous States and Actions.

Yao MaTingting ZhaoKohei HatanoMasashi Sugiyama
Published in: ECML/PKDD (2) (2014)
Keyphrases