Login / Signup
Linear combination of one-step predictive information with an external reward in an episodic policy gradient setting: a critical analysis.
Keyan Zahedi
Georg Martius
Nihat Ay
Published in:
CoRR (2013)
Keyphrases
</>
linear combination
basis functions
small number
reinforcement learning
machine learning
feature selection
function approximation