Login / Signup
DICES Dataset: Diversity in Conversational AI Evaluation for Safety.
Lora Aroyo
Alex S. Taylor
Mark Díaz
Christopher Homan
Alicia Parrish
Gregory Serapio-García
Vinodkumar Prabhakaran
Ding Wang
Published in:
NeurIPS (2023)
Keyphrases
</>
artificial intelligence
multi modal
database
expert systems
intelligent systems
evaluation model
rule interestingness measures
neural network
real world
knowledge base
recommender systems
case based reasoning
information retrieval systems
test collection
gold standard
intelligent behavior