Login / Signup
Junxiao Yang
Publication Activity (10 Years)
Years Active: 2023-2024
Publications (10 Years): 3
Top Topics
Document Length
Language Models For Information Retrieval
Ddos Attacks
N Gram
Top Venues
CoRR
ACL (1)
</>
Publications
</>
Zhexin Zhang
,
Junxiao Yang
,
Pei Ke
,
Fei Mi
,
Hongning Wang
,
Minlie Huang
Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization.
ACL (1)
(2024)
Zhexin Zhang
,
Junxiao Yang
,
Pei Ke
,
Shiyao Cui
,
Chujie Zheng
,
Hongning Wang
,
Minlie Huang
Safe Unlearning: A Surprisingly Effective and Generalizable Solution to Defend Against Jailbreak Attacks.
CoRR
(2024)
Zhexin Zhang
,
Junxiao Yang
,
Pei Ke
,
Minlie Huang
Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization.
CoRR
(2023)