Login / Signup
Nouha Dziri
Publication Activity (10 Years)
Years Active: 2018-2024
Publications (10 Years): 47
Top Topics
Iterative Refinement
Dialogue System
Question Answering
Language Model
Top Venues
CoRR
ICLR
NeurIPS
Trans. Assoc. Comput. Linguistics
</>
Publications
</>
Bill Yuchen Lin
,
Yuntian Deng
,
Khyathi Raghavi Chandu
,
Faeze Brahman
,
Abhilasha Ravichander
,
Valentina Pyatkin
,
Nouha Dziri
,
Ronan Le Bras
,
Yejin Choi
WildBench: Benchmarking LLMs with Challenging Tasks from Real Users in the Wild.
CoRR
(2024)
Kaitlyn Zhou
,
Jena D. Hwang
,
Xiang Ren
,
Nouha Dziri
,
Dan Jurafsky
,
Maarten Sap
Rel-A.I.: An Interaction-Centered Approach To Measuring Human-LM Reliance.
CoRR
(2024)
Peter West
,
Ximing Lu
,
Nouha Dziri
,
Faeze Brahman
,
Linjie Li
,
Jena D. Hwang
,
Liwei Jiang
,
Jillian Fisher
,
Abhilasha Ravichander
,
Khyathi Raghavi Chandu
,
Benjamin Newman
,
Pang Wei Koh
,
Allyson Ettinger
,
Yejin Choi
The Generative AI Paradox: "What It Can Create, It May Not Understand".
ICLR
(2024)
Liwei Jiang
,
Kavel Rao
,
Seungju Han
,
Allyson Ettinger
,
Faeze Brahman
,
Sachin Kumar
,
Niloofar Mireshghallah
,
Ximing Lu
,
Maarten Sap
,
Yejin Choi
,
Nouha Dziri
WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models.
CoRR
(2024)
Taylor Sorensen
,
Jared Moore
,
Jillian Fisher
,
Mitchell L. Gordon
,
Niloofar Mireshghallah
,
Christopher Michael Rytting
,
Andre Ye
,
Liwei Jiang
,
Ximing Lu
,
Nouha Dziri
,
Tim Althoff
,
Yejin Choi
A Roadmap to Pluralistic Alignment.
CoRR
(2024)
Nathan Lambert
,
Valentina Pyatkin
,
Jacob Morrison
,
LJ Miranda
,
Bill Yuchen Lin
,
Khyathi Raghavi Chandu
,
Nouha Dziri
,
Sachin Kumar
,
Tom Zick
,
Yejin Choi
,
Noah A. Smith
,
Hannaneh Hajishirzi
RewardBench: Evaluating Reward Models for Language Modeling.
CoRR
(2024)
Taylor Sorensen
,
Liwei Jiang
,
Jena D. Hwang
,
Sydney Levine
,
Valentina Pyatkin
,
Peter West
,
Nouha Dziri
,
Ximing Lu
,
Kavel Rao
,
Chandra Bhagavatula
,
Maarten Sap
,
John Tasioulas
,
Yejin Choi
Value Kaleidoscope: Engaging AI with Pluralistic Human Values, Rights, and Duties.
AAAI
(2024)
Faeze Brahman
,
Sachin Kumar
,
Vidhisha Balachandran
,
Pradeep Dasigi
,
Valentina Pyatkin
,
Abhilasha Ravichander
,
Sarah Wiegreffe
,
Nouha Dziri
,
Khyathi Raghavi Chandu
,
Jack Hessel
,
Yulia Tsvetkov
,
Noah A. Smith
,
Yejin Choi
,
Hannaneh Hajishirzi
The Art of Saying No: Contextual Noncompliance in Language Models.
CoRR
(2024)
Bill Yuchen Lin
,
Abhilasha Ravichander
,
Ximing Lu
,
Nouha Dziri
,
Melanie Sclar
,
Khyathi Raghavi Chandu
,
Chandra Bhagavatula
,
Yejin Choi
The Unlocking Spell on Base LLMs: Rethinking Alignment via In-Context Learning.
ICLR
(2024)
Nico Daheim
,
Nouha Dziri
,
Mrinmaya Sachan
,
Iryna Gurevych
,
Edoardo M. Ponti
Elastic Weight Removal for Faithful and Abstractive Dialogue Generation.
NAACL-HLT
(2024)
Linlu Qiu
,
Liwei Jiang
,
Ximing Lu
,
Melanie Sclar
,
Valentina Pyatkin
,
Chandra Bhagavatula
,
Bailin Wang
,
Yoon Kim
,
Yejin Choi
,
Nouha Dziri
,
Xiang Ren
Phenomenal Yet Puzzling: Testing Inductive Reasoning Capabilities of Language Models with Hypothesis Refinement.
ICLR
(2024)
Seungju Han
,
Kavel Rao
,
Allyson Ettinger
,
Liwei Jiang
,
Bill Yuchen Lin
,
Nathan Lambert
,
Yejin Choi
,
Nouha Dziri
WildGuard: Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs.
CoRR
(2024)
Huihan Li
,
Liwei Jiang
,
Jena D. Huang
,
Hyunwoo Kim
,
Sebastin Santy
,
Taylor Sorensen
,
Bill Yuchen Lin
,
Nouha Dziri
,
Xiang Ren
,
Yejin Choi
CULTURE-GEN: Revealing Global Cultural Perception in Language Models through Natural Language Prompting.
CoRR
(2024)
Nico Daheim
,
Nouha Dziri
,
Mrinmaya Sachan
,
Iryna Gurevych
,
Edoardo M. Ponti
Elastic Weight Removal for Faithful and Abstractive Dialogue Generation.
CoRR
(2023)
Seungju Han
,
Jack Hessel
,
Nouha Dziri
,
Yejin Choi
,
Youngjae Yu
Champagne: Learning Real-world Conversation from Large-Scale Web Videos.
ICCV
(2023)
Taylor Sorensen
,
Liwei Jiang
,
Jena D. Hwang
,
Sydney Levine
,
Valentina Pyatkin
,
Peter West
,
Nouha Dziri
,
Ximing Lu
,
Kavel Rao
,
Chandra Bhagavatula
,
Maarten Sap
,
John Tasioulas
,
Yejin Choi
Value Kaleidoscope: Engaging AI with Pluralistic Human Values, Rights, and Duties.
CoRR
(2023)
Ximing Lu
,
Faeze Brahman
,
Peter West
,
Jaehun Jung
,
Khyathi Chandu
,
Abhilasha Ravichander
,
Lianhui Qin
,
Prithviraj Ammanabrolu
,
Liwei Jiang
,
Sahana Ramnath
,
Nouha Dziri
,
Jillian Fisher
,
Bill Yuchen Lin
,
Skyler Hallinan
,
Xiang Ren
,
Sean Welleck
,
Yejin Choi
Inference-Time Policy Adapters (IPA): Tailoring Extreme-Scale LMs without Fine-tuning.
CoRR
(2023)
Nouha Dziri
,
Ximing Lu
,
Melanie Sclar
,
Xiang Lorraine Li
,
Liwei Jiang
,
Bill Yuchen Lin
,
Sean Welleck
,
Peter West
,
Chandra Bhagavatula
,
Ronan Le Bras
,
Jena D. Hwang
,
Soumya Sanyal
,
Xiang Ren
,
Allyson Ettinger
,
Zaïd Harchaoui
,
Yejin Choi
Faith and Fate: Limits of Transformers on Compositionality.
NeurIPS
(2023)
Zeqiu Wu
,
Yushi Hu
,
Weijia Shi
,
Nouha Dziri
,
Alane Suhr
,
Prithviraj Ammanabrolu
,
Noah A. Smith
,
Mari Ostendorf
,
Hannaneh Hajishirzi
Fine-Grained Human Feedback Gives Better Rewards for Language Model Training.
NeurIPS
(2023)
Aman Madaan
,
Niket Tandon
,
Prakhar Gupta
,
Skyler Hallinan
,
Luyu Gao
,
Sarah Wiegreffe
,
Uri Alon
,
Nouha Dziri
,
Shrimai Prabhumoye
,
Yiming Yang
,
Shashank Gupta
,
Bodhisattwa Prasad Majumder
,
Katherine Hermann
,
Sean Welleck
,
Amir Yazdanbakhsh
,
Peter Clark
Self-Refine: Iterative Refinement with Self-Feedback.
NeurIPS
(2023)
Ehsan Kamalloo
,
Nouha Dziri
,
Charles L. A. Clarke
,
Davood Rafiei
Evaluating Open-Domain Question Answering in the Era of Large Language Models.
CoRR
(2023)
Ximing Lu
,
Faeze Brahman
,
Peter West
,
Jaehun Jung
,
Khyathi Chandu
,
Abhilasha Ravichander
,
Prithviraj Ammanabrolu
,
Liwei Jiang
,
Sahana Ramnath
,
Nouha Dziri
,
Jillian Fisher
,
Bill Lin
,
Skyler Hallinan
,
Lianhui Qin
,
Xiang Ren
,
Sean Welleck
,
Yejin Choi
Inference-Time Policy Adapters (IPA): Tailoring Extreme-Scale LMs without Fine-tuning.
EMNLP
(2023)
Seungju Han
,
Jack Hessel
,
Nouha Dziri
,
Yejin Choi
,
Youngjae Yu
CHAMPAGNE: Learning Real-world Conversation from Large-Scale Web Videos.
CoRR
(2023)
Aman Madaan
,
Niket Tandon
,
Prakhar Gupta
,
Skyler Hallinan
,
Luyu Gao
,
Sarah Wiegreffe
,
Uri Alon
,
Nouha Dziri
,
Shrimai Prabhumoye
,
Yiming Yang
,
Sean Welleck
,
Bodhisattwa Prasad Majumder
,
Shashank Gupta
,
Amir Yazdanbakhsh
,
Peter Clark
Self-Refine: Iterative Refinement with Self-Feedback.
CoRR
(2023)
Kavel Rao
,
Liwei Jiang
,
Valentina Pyatkin
,
Yuling Gu
,
Niket Tandon
,
Nouha Dziri
,
Faeze Brahman
,
Yejin Choi
What Makes it Ok to Set a Fire? Iterative Self-distillation of Contexts and Rationales for Disambiguating Defeasible Social and Moral Situations.
CoRR
(2023)
Zeqiu Wu
,
Yushi Hu
,
Weijia Shi
,
Nouha Dziri
,
Alane Suhr
,
Prithviraj Ammanabrolu
,
Noah A. Smith
,
Mari Ostendorf
,
Hannaneh Hajishirzi
Fine-Grained Human Feedback Gives Better Rewards for Language Model Training.
CoRR
(2023)
Nouha Dziri
,
Ximing Lu
,
Melanie Sclar
,
Xiang Lorraine Li
,
Liwei Jiang
,
Bill Yuchen Lin
,
Peter West
,
Chandra Bhagavatula
,
Ronan Le Bras
,
Jena D. Hwang
,
Soumya Sanyal
,
Sean Welleck
,
Xiang Ren
,
Allyson Ettinger
,
Zaïd Harchaoui
,
Yejin Choi
Faith and Fate: Limits of Transformers on Compositionality.
CoRR
(2023)
Linlu Qiu
,
Liwei Jiang
,
Ximing Lu
,
Melanie Sclar
,
Valentina Pyatkin
,
Chandra Bhagavatula
,
Bailin Wang
,
Yoon Kim
,
Yejin Choi
,
Nouha Dziri
,
Xiang Ren
Phenomenal Yet Puzzling: Testing Inductive Reasoning Capabilities of Language Models with Hypothesis Refinement.
CoRR
(2023)
Kavel Rao
,
Liwei Jiang
,
Valentina Pyatkin
,
Yuling Gu
,
Niket Tandon
,
Nouha Dziri
,
Faeze Brahman
,
Yejin Choi
What Makes it Ok to Set a Fire? Iterative Self-distillation of Contexts and Rationales for Disambiguating Defeasible Social and Moral Situations.
EMNLP (Findings)
(2023)
Peter West
,
Ximing Lu
,
Nouha Dziri
,
Faeze Brahman
,
Linjie Li
,
Jena D. Hwang
,
Liwei Jiang
,
Jillian Fisher
,
Abhilasha Ravichander
,
Khyathi Chandu
,
Benjamin Newman
,
Pang Wei Koh
,
Allyson Ettinger
,
Yejin Choi
The Generative AI Paradox: "What It Can Create, It May Not Understand".
CoRR
(2023)
Bill Yuchen Lin
,
Abhilasha Ravichander
,
Ximing Lu
,
Nouha Dziri
,
Melanie Sclar
,
Khyathi Chandu
,
Chandra Bhagavatula
,
Yejin Choi
The Unlocking Spell on Base LLMs: Rethinking Alignment via In-Context Learning.
CoRR
(2023)
Ehsan Kamalloo
,
Nouha Dziri
,
Charles L. A. Clarke
,
Davood Rafiei
Evaluating Open-Domain Question Answering in the Era of Large Language Models.
ACL (1)
(2023)
Nouha Dziri
,
Ehsan Kamalloo
,
Sivan Milton
,
Osmar Zaïane
,
Mo Yu
,
Edoardo Maria Ponti
,
Siva Reddy
FaithDial: A Faithful Benchmark for Information-Seeking Dialogue.
CoRR
(2022)
Nouha Dziri
,
Hannah Rashkin
,
Tal Linzen
,
David Reitter
Evaluating Attribution in Dialogue Systems: The BEGIN Benchmark.
Trans. Assoc. Comput. Linguistics
10 (2022)
Nouha Dziri
,
Sivan Milton
,
Mo Yu
,
Osmar Zaïane
,
Siva Reddy
On the Origin of Hallucinations in Conversational Models: Is it the Datasets or the Models?
CoRR
(2022)
Nouha Dziri
,
Ehsan Kamalloo
,
Sivan Milton
,
Osmar R. Zaïane
,
Mo Yu
,
Edoardo M. Ponti
,
Siva Reddy
FaithDial: A Faithful Benchmark for Information-Seeking Dialogue.
Trans. Assoc. Comput. Linguistics
10 (2022)
Nouha Dziri
,
Sivan Milton
,
Mo Yu
,
Osmar R. Zaïane
,
Siva Reddy
On the Origin of Hallucinations in Conversational Models: Is it the Datasets or the Models?
NAACL-HLT
(2022)
Alessandro Sordoni
,
Nouha Dziri
,
Hannes Schulz
,
Geoffrey J. Gordon
,
Philip Bachman
,
Remi Tachet des Combes
Decomposed Mutual Information Estimation for Contrastive Representation Learning.
ICML
(2021)
Alessandro Sordoni
,
Nouha Dziri
,
Hannes Schulz
,
Geoffrey J. Gordon
,
Philip Bachman
,
Remi Tachet des Combes
Decomposed Mutual Information Estimation for Contrastive Representation Learning.
CoRR
(2021)
Nouha Dziri
,
Hannah Rashkin
,
Tal Linzen
,
David Reitter
Evaluating Groundedness in Dialogue Systems: The BEGIN Benchmark.
CoRR
(2021)
Nouha Dziri
,
Andrea Madotto
,
Osmar Zaïane
,
Avishek Joey Bose
Neural Path Hunter: Reducing Hallucination in Dialogue Systems via Path Grounding.
EMNLP (1)
(2021)
Nouha Dziri
,
Andrea Madotto
,
Osmar R. Zaïane
,
Avishek Joey Bose
Neural Path Hunter: Reducing Hallucination in Dialogue Systems via Path Grounding.
CoRR
(2021)
Nouha Dziri
,
Ehsan Kamalloo
,
Kory W. Mathewson
,
Osmar R. Zaïane
Evaluating Coherence in Dialogue Systems using Entailment.
CoRR
(2019)
Nouha Dziri
,
Ehsan Kamalloo
,
Kory W. Mathewson
,
Osmar R. Zaïane
Evaluating Coherence in Dialogue Systems using Entailment.
NAACL-HLT (1)
(2019)
Nouha Dziri
,
Ehsan Kamalloo
,
Kory W. Mathewson
,
Osmar R. Zaïane
Evaluating Coherence in Dialogue Systems using Entailment.
WNLP@ACL
(2019)
Nouha Dziri
,
Ehsan Kamalloo
,
Kory W. Mathewson
,
Osmar R. Zaïane
Augmenting Neural Response Generation with Context-Aware Topical Attention.
CoRR
(2018)
Chenyang Huang
,
Osmar R. Zaïane
,
Amine Trabelsi
,
Nouha Dziri
Automatic Dialogue Generation with Expressed Emotions.
NAACL-HLT (2)
(2018)