Login / Signup
Rosie Zhao
Publication Activity (10 Years)
Years Active: 2020-2024
Publications (10 Years): 15
Top Topics
Reinforcement Learning
Lower Bound
Online Learning
Policy Gradient
Top Venues
CoRR
Electron. Colloquium Comput. Complex.
APPROX/RANDOM
Discret. Math. Theor. Comput. Sci.
</>
Publications
</>
Rosie Zhao
,
Depen Morwani
,
David Brandfonbrener
,
Nikhil Vyas
,
Sham M. Kakade
Deconstructing What Makes a Good Optimizer for Language Models.
CoRR
(2024)
Depen Morwani
,
Benjamin L. Edelman
,
Costin-Andrei Oncescu
,
Rosie Zhao
,
Sham M. Kakade
Feature emergence via margin maximization: case studies in algebraic tasks.
ICLR
(2024)
Zaheer Abbas
,
Rosie Zhao
,
Joseph Modayil
,
Adam White
,
Marlos C. Machado
Loss of Plasticity in Continual Deep Reinforcement Learning.
CoRR
(2023)
Nikhil Vyas
,
Depen Morwani
,
Rosie Zhao
,
Gal Kaplun
,
Sham M. Kakade
,
Boaz Barak
Beyond Implicit Bias: The Insignificance of SGD Noise in Online Learning.
CoRR
(2023)
Prakash Panangaden
,
Sahand Rezaei-Shoshtari
,
Rosie Zhao
,
David Meger
,
Doina Precup
Policy Gradient Methods in the Presence of Symmetries and State Abstractions.
CoRR
(2023)
Depen Morwani
,
Benjamin L. Edelman
,
Costin-Andrei Oncescu
,
Rosie Zhao
,
Sham M. Kakade
Feature emergence via margin maximization: case studies in algebraic tasks.
CoRR
(2023)
Zaheer Abbas
,
Rosie Zhao
,
Joseph Modayil
,
Adam White
,
Marlos C. Machado
Loss of Plasticity in Continual Deep Reinforcement Learning.
CoLLAs
(2023)
Hamed Hatami
,
Pooya Hatami
,
William Pires
,
Ran Tao
,
Rosie Zhao
Lower Bound Methods for Sign-Rank and Their Limitations.
APPROX/RANDOM
(2022)
Sahand Rezaei-Shoshtari
,
Rosie Zhao
,
Prakash Panangaden
,
David Meger
,
Doina Precup
Continuous MDP Homomorphisms and Homomorphic Policy Gradient.
NeurIPS
(2022)
Hamed Hatami
,
Pooya Hatami
,
William Pires
,
Ran Tao
,
Rosie Zhao
Lower Bound Methods for Sign-rank and their Limitations.
Electron. Colloquium Comput. Complex.
(2022)
Anna M. Brandenberger
,
Luc Devroye
,
Marcel K. Goh
,
Rosie Zhao
Leaf multiplicity in a Bienaymé-Galton-Watson tree.
Discret. Math. Theor. Comput. Sci.
24 (1) (2022)
TsunMing Cheung
,
Hamed Hatami
,
Rosie Zhao
,
Itai Zilberstein
Boolean functions with small approximate spectral norm.
Electron. Colloquium Comput. Complex.
(2022)
Sahand Rezaei-Shoshtari
,
Rosie Zhao
,
Prakash Panangaden
,
David Meger
,
Doina Precup
Continuous MDP Homomorphisms and Homomorphic Policy Gradient.
CoRR
(2022)
Mikael Brunila
,
Rosie Zhao
,
Andrei Mircea
,
Sam Lumley
,
Renée Sieber
Bridging the gap between supervised classification and unsupervised topic modelling for social-media assisted crisis management.
CoRR
(2021)
Gavin McCracken
,
Colin Daniels
,
Rosie Zhao
,
Anna Brandenberger
,
Prakash Panangaden
,
Doina Precup
A Study of Policy Gradient on a Class of Exactly Solvable Models.
CoRR
(2020)