Login / Signup
Rongwu Xu
Publication Activity (10 Years)
Years Active: 2022-2024
Publications (10 Years): 15
Top Topics
Language Model
Hidden States
Top Venues
CoRR
EuroS&P
ACL (Findings)
CHIIR
</>
Publications
</>
Rongwu Xu
,
Zehan Qi
,
Wei Xu
Preemptive Answer "Attacks" on Chain-of-Thought Reasoning.
CoRR
(2024)
Rongwu Xu
,
Yishuo Cai
,
Zhenhong Zhou
,
Renjie Gu
,
Haiqin Weng
,
Yan Liu
,
Tianwei Zhang
,
Wei Xu
,
Han Qiu
Course-Correction: Safety Alignment Using Synthetic Preferences.
CoRR
(2024)
Rongwu Xu
,
Zehan Qi
,
Cunxiang Wang
,
Hongru Wang
,
Yue Zhang
,
Wei Xu
Knowledge Conflicts for LLMs: A Survey.
CoRR
(2024)
Zhongshen Zeng
,
Yinhong Liu
,
Yingjia Wan
,
Jingyao Li
,
Pengguang Chen
,
Jianbo Dai
,
Yuxuan Yao
,
Rongwu Xu
,
Zehan Qi
,
Wanru Zhao
,
Linling Shen
,
Jianqiao Lu
,
Haochen Tan
,
Yukang Chen
,
Hao Zhang
,
Zhan Shi
,
Bailin Wang
,
Zhijiang Guo
,
Jiaya Jia
MR-BEN: A Comprehensive Meta-Reasoning Benchmark for Large Language Models.
CoRR
(2024)
Rongwu Xu
,
Zehan Qi
,
Wei Xu
Preemptive Answer "Attacks" on Chain-of-Thought Reasoning.
ACL (Findings)
(2024)
Zhenhong Zhou
,
Haiyang Yu
,
Xinghua Zhang
,
Rongwu Xu
,
Fei Huang
,
Yongbin Li
How Alignment and Jailbreak Work: Explain LLM Safety through Intermediate Hidden States.
CoRR
(2024)
Rongwu Xu
Exploring Chinese Humor Generation: A Study on Two-Part Allegorical Sayings.
CoRR
(2024)
Rongwu Xu
,
Zi'an Zhou
,
Tianwei Zhang
,
Zehan Qi
,
Su Yao
,
Ke Xu
,
Wei Xu
,
Han Qiu
Walking in Others' Shoes: How Perspective-Taking Guides Large Language Models in Reducing Toxicity and Bias.
CoRR
(2024)
Rongwu Xu
,
Zhixuan Fang
Tempo: Confidentiality Preservation in Cloud-Based Neural Network Training.
CoRR
(2024)
Rongwu Xu
,
Brian S. Lin
,
Shujian Yang
,
Tianqi Zhang
,
Weiyan Shi
,
Tianwei Zhang
,
Zhixuan Fang
,
Wei Xu
,
Han Qiu
The Earth is Flat because...: Investigating LLMs' Belief towards Misinformation via Persuasive Conversation.
CoRR
(2023)
Rongwu Xu
,
Sen Yang
,
Fan Zhang
,
Zhixuan Fang
MISO: Legacy-compatible Privacy-preserving Single Sign-on using Trusted Execution Environments.
CoRR
(2023)
Rongwu Xu
,
Sen Yang
,
Fan Zhang
,
Zhixuan Fang
MISO: Legacy-compatible Privacy-preserving Single Sign-on using Trusted Execution Environments.
EuroS&P
(2023)
Xiufeng Huang
,
Rongwu Xu
,
Wenjing Yu
,
Tao Peng
Research on structural sound source localization method by neural network.
EURASIP J. Adv. Signal Process.
2023 (1) (2023)
Yifan Xu
,
Fan Dang
,
Rongwu Xu
,
Xinlei Chen
,
Yunhao Liu
LSync: A Universal Event-synchronizing Solution for Live Streaming.
INFOCOM
(2022)
Jiayu Li
,
Hantian Zhang
,
Zhiyu He
,
Rongwu Xu
,
Pingfei Wu
,
Min Zhang
,
Yiqun Liu
,
Shaoping Ma
LifeRec: A Mobile App for Lifelog Recording and Ubiquitous Recommendation.
CHIIR
(2022)