Login / Signup
Greedy Actor-Critic: A New Conditional Cross-Entropy Method for Policy Improvement.
Samuel Neumann
Sungsu Lim
Ajin George Joseph
Yangchen Pan
Adam White
Martha White
Published in:
ICLR (2023)
Keyphrases
</>
cross entropy
dynamic programming
neural network
cost function
mathematical model
reinforcement learning
error function
objective function
active learning
probabilistic model
support vector machine
linear regression
model free