A note on reinforcement learning with Wasserstein distance regularisation, with applications to multipolicy learning.

Published in: CoRR (2018)

Keyphrases