Login / Signup

Recurrent policy gradients.

Daan WierstraAlexander FörsterJan PetersJürgen Schmidhuber
Published in: Log. J. IGPL (2010)
Keyphrases
  • optimal policy
  • real world
  • asymptotically optimal
  • real time
  • information retrieval
  • artificial intelligence
  • information systems
  • knowledge base
  • steady state
  • feed forward
  • policy making