Login / Signup
Wenxuan Wang
ORCID
Publication Activity (10 Years)
Years Active: 2024-2024
Publications (10 Years): 4
Top Topics
Cross Lingual Information Retrieval
Cultural Dimensions
Multi Lingual
Language Model
Top Venues
ACL (Findings)
ACL (1)
CoRR
ICLR
</>
Publications
</>
Youliang Yuan
,
Wenxiang Jiao
,
Wenxuan Wang
,
Jen-tse Huang
,
Jiahao Xu
,
Tian Liang
,
Pinjia He
,
Zhaopeng Tu
Refuse Whenever You Feel Unsafe: Improving Safety in LLMs via Decoupled Refusal Training.
CoRR
(2024)
Youliang Yuan
,
Wenxiang Jiao
,
Wenxuan Wang
,
Jen-tse Huang
,
Pinjia He
,
Shuming Shi
,
Zhaopeng Tu
GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher.
ICLR
(2024)
Wenxuan Wang
,
Zhaopeng Tu
,
Chang Chen
,
Youliang Yuan
,
Jen-tse Huang
,
Wenxiang Jiao
,
Michael R. Lyu
All Languages Matter: On the Multilingual Safety of LLMs.
ACL (Findings)
(2024)
Wenxuan Wang
,
Wenxiang Jiao
,
Jingyuan Huang
,
Ruyi Dai
,
Jen-tse Huang
,
Zhaopeng Tu
,
Michael R. Lyu
Not All Countries Celebrate Thanksgiving: On the Cultural Dominance in Large Language Models.
ACL (1)
(2024)