​
Login / Signup
Lu Lin
ORCID
Publication Activity (10 Years)
Years Active: 2024-2024
Publications (10 Years): 7
Top Topics
Language Modelling
Document Ranking
Top Venues
CoRR
ACL (1)
ICLR
</>
Publications
</>
Bochuan Cao
,
Yuanpu Cao
,
Lu Lin
,
Jinghui Chen
Defending Against Alignment-Breaking Attacks via Robustly Aligned LLM.
ACL (1)
(2024)
Tianrong Zhang
,
Bochuan Cao
,
Yuanpu Cao
,
Lu Lin
,
Prasenjit Mitra
,
Jinghui Chen
WordGame: Efficient & Effective LLM Jailbreak via Simultaneous Obfuscation in Query and Response.
CoRR
(2024)
Yuanpu Cao
,
Tianrong Zhang
,
Bochuan Cao
,
Ziyi Yin
,
Lu Lin
,
Fenglong Ma
,
Jinghui Chen
Personalized Steering of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization.
CoRR
(2024)
Yurui Chang
,
Bochuan Cao
,
Yujia Wang
,
Jinghui Chen
,
Lu Lin
XPrompt:Explaining Large Language Model's Generation via Joint Prompt Attribution.
CoRR
(2024)
Hangfan Zhang
,
Zhimeng Guo
,
Huaisheng Zhu
,
Bochuan Cao
,
Lu Lin
,
Jinyuan Jia
,
Jinghui Chen
,
Dinghao Wu
Jailbreak Open-Sourced Large Language Models via Enforced Decoding.
ACL (1)
(2024)
Weiyu Sun
,
Xinyu Zhang
,
Hao Lu
,
Yingcong Chen
,
Ting Wang
,
Jinghui Chen
,
Lu Lin
Backdoor Contrastive Learning via Bi-level Trigger Optimization.
CoRR
(2024)
Weiyu Sun
,
Xinyu Zhang
,
Hao Lu
,
Ying-Cong Chen
,
Ting Wang
,
Jinghui Chen
,
Lu Lin
Backdoor Contrastive Learning via Bi-level Trigger Optimization.
ICLR
(2024)