Guiding the search in continuous state-action spaces by learning an action sampling distribution from off-target samples.

Published in: CoRR (2017)

Keyphrases