Login / Signup
An Actor-Critic Algorithm for Sequence Prediction.
Dzmitry Bahdanau
Philemon Brakel
Kelvin Xu
Anirudh Goyal
Ryan Lowe
Joelle Pineau
Aaron C. Courville
Yoshua Bengio
Published in:
ICLR (Poster) (2017)
Keyphrases
</>
sequence prediction
dynamic programming
learning algorithm
reinforcement learning
search space
artificial intelligence
probabilistic model
monte carlo
optimal solution
linear programming
machine learning algorithms
step size
negative matrix factorization
gradient method
actor critic