Login / Signup
DoMo-AC: Doubly Multi-step Off-policy Actor-Critic Algorithm.
Yunhao Tang
Tadashi Kozuno
Mark Rowland
Anna Harutyunyan
Rémi Munos
Bernardo Ávila Pires
Michal Valko
Published in:
ICML (2023)
Keyphrases
</>
multi step
learning algorithm
dynamic programming
machine learning
objective function
search space
particle swarm optimization
optimization algorithm
optimal solution
feature vectors
np hard
nearest neighbor