Login / Signup

An Empirical Analysis of Measure-Valued Derivatives for Policy Gradients.

João CarvalhoDavide TateoFabio MuratoreJan Peters
Published in: IJCNN (2021)
Keyphrases
  • information theory
  • similarity measure
  • partial derivatives
  • distance measure
  • neural network
  • optimal policy
  • policy makers