Login / Signup
Combining Deep Deterministic Policy Gradient with Cross-Entropy Method.
Tung-Yi Lai
Chu-Hsuan Hsueh
You-Hsuan Lin
Yeong-Jia Roger Chu
Bo Yang Hsueh
I-Chen Wu
Published in:
TAAI (2019)
Keyphrases
</>
cross entropy
gradient ascent
maximum likelihood
energy function
active learning
information retrieval
objective function
probabilistic model
error function