C
search
search
reviewers
reviewers
feeds
feeds
assignments
assignments
settings
logout
Omid Saremi
ORCID
Publication Activity (10 Years)
Years Active: 2015-2023
Publications (10 Years): 9
Top Topics
Learning Tasks
Robust Image Watermarking
Classifier Systems
Function Approximators
Top Venues
CoRR
Complex Adaptive Systems
</>
Publications
</>
Noam Razin
,
Hattie Zhou
,
Omid Saremi
,
Vimal Thilak
,
Arwen Bradley
,
Preetum Nakkiran
,
Joshua M. Susskind
,
Etai Littwin
Vanishing Gradients in Reinforcement Finetuning of Language Models.
CoRR
(2023)
Enric Boix-Adserà
,
Omid Saremi
,
Emmanuel Abbe
,
Samy Bengio
,
Etai Littwin
,
Joshua M. Susskind
When can transformers reason with abstract symbols?
CoRR
(2023)
Vimal Thilak
,
Chen Huang
,
Omid Saremi
,
Laurent Dinh
,
Hanlin Goh
,
Preetum Nakkiran
,
Joshua M. Susskind
,
Etai Littwin
LiDAR: Sensing Linear Probing Performance in Joint Embedding SSL Architectures.
CoRR
(2023)
Hattie Zhou
,
Arwen Bradley
,
Etai Littwin
,
Noam Razin
,
Omid Saremi
,
Josh M. Susskind
,
Samy Bengio
,
Preetum Nakkiran
What Algorithms can Transformers Learn? A Study in Length Generalization.
CoRR
(2023)
Samira Abnar
,
Omid Saremi
,
Laurent Dinh
,
Shantel Wilson
,
Miguel Ángel Bautista
,
Chen Huang
,
Vimal Thilak
,
Etai Littwin
,
Jiatao Gu
,
Josh M. Susskind
,
Samy Bengio
Adaptivity and Modularity for Efficient Generalization Over Task Complexity.
CoRR
(2023)
Vimal Thilak
,
Etai Littwin
,
Shuangfei Zhai
,
Omid Saremi
,
Roni Paiss
,
Joshua M. Susskind
The Slingshot Mechanism: An Empirical Study of Adaptive Optimizers and the Grokking Phenomenon.
CoRR
(2022)
Etai Littwin
,
Omid Saremi
,
Shuangfei Zhai
,
Vimal Thilak
,
Hanlin Goh
,
Joshua M. Susskind
,
Greg Yang
Implicit Acceleration and Feature Learning in Infinitely Wide Neural Networks with Bottlenecks.
CoRR
(2021)
Shih-Yu Sun
,
Vimal Thilak
,
Etai Littwin
,
Omid Saremi
,
Joshua M. Susskind
Implicit Greedy Rank Learning in Autoencoders via Overparameterized Linear Networks.
CoRR
(2021)
Omid Saremi
,
Masoud Shariat Panahi
,
Amin Sabzehzar
An Improved Continuous-Action Extended Classifier Systems for Function Approximation.
Complex Adaptive Systems
(2015)