Login / Signup
Rusheb Shah
Publication Activity (10 Years)
Years Active: 2023-2023
Publications (10 Years): 4
Top Topics
Combinatorial Optimization
Language Model
Black Boxes
Structured Representations
Top Venues
CoRR
UniReps
</>
Publications
</>
Rusheb Shah
,
Quentin Feuillade-Montixi
,
Soroush Pour
,
Arush Tagade
,
Stephen Casper
,
Javier Rando
Scalable and Transferable Black-Box Jailbreaks for Language Models via Persona Modulation.
CoRR
(2023)
Michael Igorevich Ivanitskiy
,
Alex F. Spies
,
Tilman Räuker
,
Guillaume Corlouer
,
Chris Mathwin
,
Lucia Quirke
,
Can Rager
,
Rusheb Shah
,
Dan Valentine
,
Cecilia G. Diniz Behn
,
Katsumi Inoue
,
Samy Wu Fung
Structured World Representations in Maze-Solving Transformers.
CoRR
(2023)
Michael I. Ivanitskiy
,
Alex F. Spies
,
Tilman Räuker
,
Guillaume Corlouer
,
Chris Mathwin
,
Lucia Quirke
,
Can Rager
,
Rusheb Shah
,
Dan Valentine
,
Cecilia G. Diniz Behn
,
Katsumi Inoue
,
Samy Wu Fung
Linearly Structured World Representations in Maze-Solving Transformers.
UniReps
(2023)
Michael Igorevich Ivanitskiy
,
Rusheb Shah
,
Alex F. Spies
,
Tilman Räuker
,
Dan Valentine
,
Can Rager
,
Lucia Quirke
,
Chris Mathwin
,
Guillaume Corlouer
,
Cecilia G. Diniz Behn
,
Samy Wu Fung
A Configurable Library for Generating and Manipulating Maze Datasets.
CoRR
(2023)