Login / Signup
Miguel Suau
Publication Activity (10 Years)
Years Active: 2019-2023
Publications (10 Years): 12
Top Topics
Reinforcement Learning
Partially Observable
Belief State
Sequential Decision Tasks
Top Venues
CoRR
NeurIPS
AAMAS
ICML
</>
Publications
</>
Miguel Suau
,
Matthijs T. J. Spaan
,
Frans A. Oliehoek
Bad Habits: Policy Confounding and Out-of-Trajectory Generalization in RL.
CoRR
(2023)
Miguel Suau
,
Jinke He
,
Mustafa Mert Çelikok
,
Matthijs T. J. Spaan
,
Frans A. Oliehoek
Distributed Influence-Augmented Local Simulators for Parallel MARL in Large Networked Systems.
NeurIPS
(2022)
Miguel Suau
,
Jinke He
,
Matthijs T. J. Spaan
,
Frans A. Oliehoek
Influence-Augmented Local Simulators: A Scalable Solution for Fast Deep RL in Large Networked Systems.
CoRR
(2022)
Jinke He
,
Miguel Suau
,
Hendrik Baier
,
Michael Kaisers
,
Frans A. Oliehoek
Online Planning in POMDPs with Self-Improving Simulators.
IJCAI
(2022)
Miguel Suau
,
Jinke He
,
Matthijs T. J. Spaan
,
Frans A. Oliehoek
Speeding up Deep Reinforcement Learning through Influence-Augmented Local Simulators.
AAMAS
(2022)
Miguel Suau
,
Jinke He
,
Matthijs T. J. Spaan
,
Frans A. Oliehoek
Influence-Augmented Local Simulators: a Scalable Solution for Fast Deep RL in Large Networked Systems.
ICML
(2022)
Miguel Suau
,
Jinke He
,
Mustafa Mert Çelikok
,
Matthijs T. J. Spaan
,
Frans A. Oliehoek
Distributed Influence-Augmented Local Simulators for Parallel MARL in Large Networked Systems.
CoRR
(2022)
Jinke He
,
Miguel Suau
,
Hendrik Baier
,
Michael Kaisers
,
Frans A. Oliehoek
Online Planning in POMDPs with Self-Improving Simulators.
CoRR
(2022)
Miguel Suau
,
Alexandros Agapitos
,
David Lynch
,
Derek Farrell
,
Mingqi Zhou
,
Aleksandar Milenovic
Offline Contextual Bandits for Wireless Network Optimization.
CoRR
(2021)
Jinke He
,
Miguel Suau
,
Frans A. Oliehoek
Influence-Augmented Online Planning for Complex Environments.
CoRR
(2020)
Jinke He
,
Miguel Suau
,
Frans A. Oliehoek
Influence-Augmented Online Planning for Complex Environments.
NeurIPS
(2020)
Miguel Suau
,
Elena Congeduti
,
Rolf Starre
,
Aleksander Czechowski
,
Frans A. Oliehoek
Influence-aware Memory for Deep Reinforcement Learning.
CoRR
(2019)