Sign in
Rusheb Shah
Publication Activity (10 Years)
Years Active: 2023-2023
Publications (10 Years): 3
Top Topics
Language Model
Uci Repository
Language Models For Information Retrieval
Black Boxes
Top Venues
CoRR
</>
Publications
</>
Rusheb Shah
,
Quentin Feuillade-Montixi
,
Soroush Pour
,
Arush Tagade
,
Stephen Casper
,
Javier Rando
Scalable and Transferable Black-Box Jailbreaks for Language Models via Persona Modulation.
CoRR
(2023)
Michael Igorevich Ivanitskiy
,
Alex F. Spies
,
Tilman Räuker
,
Guillaume Corlouer
,
Chris Mathwin
,
Lucia Quirke
,
Can Rager
,
Rusheb Shah
,
Dan Valentine
,
Cecilia G. Diniz Behn
,
Katsumi Inoue
,
Samy Wu Fung
Structured World Representations in Maze-Solving Transformers.
CoRR
(2023)
Michael Igorevich Ivanitskiy
,
Rusheb Shah
,
Alex F. Spies
,
Tilman Räuker
,
Dan Valentine
,
Can Rager
,
Lucia Quirke
,
Chris Mathwin
,
Guillaume Corlouer
,
Cecilia G. Diniz Behn
,
Samy Wu Fung
A Configurable Library for Generating and Manipulating Maze Datasets.
CoRR
(2023)