Login / Signup

Distributed Off-Policy Temporal Difference Learning Using Primal-Dual Method.

Donghwan LeeDo Wan KimJianghai Hu
Published in: IEEE Access (2022)
Keyphrases
  • cost function
  • objective function
  • convergence rate
  • primal dual
  • support vector machine
  • convex optimization
  • temporal difference learning
  • computational complexity
  • pairwise
  • dynamic programming