Sign in
Reda Ouhamma
Publication Activity (10 Years)
Years Active: 2020-2023
Publications (10 Years): 9
Top Topics
Exponential Family
Bregman Divergences
Kl Divergence
Linear Regression
Top Venues
CoRR
NeurIPS
ICLR
AAAI
</>
Publications
</>
Reda Ouhamma
,
Debabrota Basu
,
Odalric Maillard
Bilinear Exponential Family of MDPs: Frequentist Regret Bound with Tractable Exploration & Planning.
AAAI
(2023)
Reda Ouhamma
,
Maryam Kamgarpour
Learning Nash Equilibria in Zero-Sum Markov Games: A Single Time-scale Algorithm Under Weak Reachability.
CoRR
(2023)
Reda Ouhamma
,
Debabrota Basu
,
Odalric-Ambrym Maillard
Bilinear Exponential Family of MDPs: Frequentist Regret Bound with Tractable Exploration and Planning.
CoRR
(2022)
Yannis Flet-Berliac
,
Reda Ouhamma
,
Odalric-Ambrym Maillard
,
Philippe Preux
Learning Value Functions in Deep Policy Gradients using Residual Variance.
ICLR
(2021)
Reda Ouhamma
,
Odalric-Ambrym Maillard
,
Vianney Perchet
Stochastic Online Linear Regression: the Forward Algorithm to Replace Ridge.
NeurIPS
(2021)
Reda Ouhamma
,
Odalric-Ambrym Maillard
,
Vianney Perchet
Online Sign Identification: Minimization of the Number of Errors in Thresholding Bandits.
NeurIPS
(2021)
Reda Ouhamma
,
Rémy Degenne
,
Pierre Gaillard
,
Vianney Perchet
Online Sign Identification: Minimization of the Number of Errors in Thresholding Bandits.
CoRR
(2021)
Reda Ouhamma
,
Odalric Maillard
,
Vianney Perchet
Stochastic Online Linear Regression: the Forward Algorithm to Replace Ridge.
CoRR
(2021)
Yannis Flet-Berliac
,
Reda Ouhamma
,
Odalric-Ambrym Maillard
,
Philippe Preux
Is Standard Deviation the New Standard? Revisiting the Critic in Deep Policy Gradients.
CoRR
(2020)