Login / Signup
Bailando: 3D Dance Generation by Actor-Critic GPT with Choreographic Memory.
Li Siyao
Weijiang Yu
Tianpei Gu
Chunze Lin
Quan Wang
Chen Qian
Chen Change Loy
Ziwei Liu
Published in:
CVPR (2022)
Keyphrases
</>
actor critic
reinforcement learning
temporal difference
policy gradient
neuro fuzzy
gradient method
approximate dynamic programming
function approximation
optimal control
reinforcement learning algorithms
average reward
dynamic programming
optimal policy
markov decision processes
convergence rate
policy iteration