Login / Signup

A Q-based policy gradient optimization approach for Doudizhu.

Xiaomin YuYisong WangJin QinPanfeng Chen
Published in: Appl. Intell. (2023)
Keyphrases
  • parametric optimization
  • policy gradient
  • optimization algorithm
  • reinforcement learning
  • function approximation
  • optimization problems
  • actor critic
  • model free reinforcement learning