C
search
search
reviewers
reviewers
feeds
feeds
assignments
assignments
settings
logout
Sample-Efficient Model-Free Reinforcement Learning with Off-Policy Critics.
Denis Steckelmacher
Hélène Plisnier
Diederik M. Roijers
Ann Nowé
Published in:
CoRR (2019)
Keyphrases
</>
learning algorithm
multi agent
upper bound