Login / Signup
Xiao Wang
Publication Activity (10 Years)
Years Active: 2024-2024
Publications (10 Years): 3
Top Topics
Disparity Estimation
Language Models For Information Retrieval
Language Model
Chosen Plaintext
Top Venues
CoRR
ACL (1)
</>
Publications
</>
Chenyu Shi
,
Xiao Wang
,
Qiming Ge
,
Songyang Gao
,
Xianjun Yang
,
Tao Gui
,
Qi Zhang
,
Xuanjing Huang
,
Xun Zhao
,
Dahua Lin
Navigating the OverKill in Large Language Models.
ACL (1)
(2024)
Caishuang Huang
,
Wanxu Zhao
,
Rui Zheng
,
Huijie Lv
,
Shihan Dou
,
Sixian Li
,
Xiao Wang
,
Enyu Zhou
,
Junjie Ye
,
Yuming Yang
,
Tao Gui
,
Qi Zhang
,
Xuanjing Huang
SafeAligner: Safety Alignment against Jailbreak Attacks via Response Disparity Guidance.
CoRR
(2024)
Xiao Wang
,
Tianze Chen
,
Xianjun Yang
,
Qi Zhang
,
Xun Zhao
,
Dahua Lin
Unveiling the Misuse Potential of Base Large Language Models via In-Context Learning.
CoRR
(2024)