Login / Signup
Hainiu Xu
Publication Activity (10 Years)
Years Active: 2023-2024
Publications (10 Years): 16
Top Topics
Multiple Models
Causal Reasoning
N Gram
Language Modelling
Top Venues
CoRR
ACL (1)
ACL (Findings)
EACL (1)
</>
Publications
</>
Li Zhang
,
Hainiu Xu
,
Abhinav Kommula
,
Chris Callison-Burch
,
Niket Tandon
OpenPI2.0: An Improved Dataset for Entity Tracking in Texts.
EACL (1)
(2024)
Hainiu Xu
,
Runcong Zhao
,
Lixing Zhu
,
Jinhua Du
,
Yulan He
OpenToM: A Comprehensive Benchmark for Evaluating Theory-of-Mind Reasoning Capabilities of Large Language Models.
ACL (1)
(2024)
Xinyu Wang
,
Hainiu Xu
,
Lin Gui
,
Yulan He
Towards Unified Task Embeddings Across Multiple Models: Bridging the Gap for Prompt-Based Large Language Models and Beyond.
ACL (Findings)
(2024)
Jiazheng Li
,
Hainiu Xu
,
Zhaoyue Sun
,
Yuxiang Zhou
,
David West
,
Cesare Aloisi
,
Yulan He
Calibrating LLMs with Preference Optimization on Thought Trees for Generating Rationale in Science Question Scoring.
CoRR
(2024)
Xinyu Wang
,
Hainiu Xu
,
Lin Gui
,
Yulan He
Towards Unified Task Embeddings Across Multiple Models: Bridging the Gap for Prompt-Based Large Language Models and Beyond.
CoRR
(2024)
Liam Dugan
,
Alyssa Hwang
,
Filip Trhlik
,
Josh Magnus Ludan
,
Andrew Zhu
,
Hainiu Xu
,
Daphne Ippolito
,
Chris Callison-Burch
RAID: A Shared Benchmark for Robust Evaluation of Machine-Generated Text Detectors.
CoRR
(2024)
Runcong Zhao
,
Qinglin Zhu
,
Hainiu Xu
,
Jiazheng Li
,
Yuxiang Zhou
,
Yulan He
,
Lin Gui
Large Language Models Fall Short: Understanding Complex Relationships in Detective Narratives.
CoRR
(2024)
Liam Dugan
,
Alyssa Hwang
,
Filip Trhlík
,
Andrew Zhu
,
Josh Magnus Ludan
,
Hainiu Xu
,
Daphne Ippolito
,
Chris Callison-Burch
RAID: A Shared Benchmark for Robust Evaluation of Machine-Generated Text Detectors.
ACL (1)
(2024)
Runcong Zhao
,
Qinglin Zhu
,
Hainiu Xu
,
Jiazheng Li
,
Yuxiang Zhou
,
Yulan He
,
Lin Gui
Large Language Models Fall Short: Understanding Complex Relationships in Detective Narratives.
ACL (Findings)
(2024)
Hainiu Xu
,
Runcong Zhao
,
Lixing Zhu
,
Jinhua Du
,
Yulan He
OpenToM: A Comprehensive Benchmark for Evaluating Theory-of-Mind Reasoning Capabilities of Large Language Models.
CoRR
(2024)
Li Zhang
,
Liam Dugan
,
Hainiu Xu
,
Chris Callison-Burch
Exploring the Curious Case of Code Prompts.
CoRR
(2023)
Tianyi Zhang
,
Isaac Tham
,
Zhaoyi Hou
,
Jiaxuan Ren
,
Liyang Zhou
,
Hainiu Xu
,
Li Zhang
,
Lara J. Martin
,
Rotem Dror
,
Sha Li
,
Heng Ji
,
Martha Palmer
,
Susan Windisch Brown
,
Reece Suchocki
,
Chris Callison-Burch
Human-in-the-Loop Schema Induction.
CoRR
(2023)
Tianyi Zhang
,
Isaac Tham
,
Zhaoyi Hou
,
Jiaxuan Ren
,
Leon Zhou
,
Hainiu Xu
,
Li Zhang
,
Lara J. Martin
,
Rotem Dror
,
Sha Li
,
Heng Ji
,
Martha Palmer
,
Susan Windisch Brown
,
Reece Suchocki
,
Chris Callison-Burch
Human-in-the-loop Schema Induction.
ACL (demo)
(2023)
Li Zhang
,
Hainiu Xu
,
Yue Yang
,
Shuyan Zhou
,
Weiqiu You
,
Manni Arora
,
Chris Callison-Burch
Causal Reasoning of Entities and Events in Procedural Texts.
CoRR
(2023)
Li Zhang
,
Hainiu Xu
,
Abhinav Kommula
,
Niket Tandon
,
Chris Callison-Burch
OpenPI2.0: An Improved Dataset for Entity Tracking in Texts.
CoRR
(2023)
Li Zhang
,
Hainiu Xu
,
Yue Yang
,
Shuyan Zhou
,
Weiqiu You
,
Manni Arora
,
Chris Callison-Burch
Causal Reasoning of Entities and Events in Procedural Texts.
EACL (Findings)
(2023)