Login / Signup
DICES Dataset: Diversity in Conversational AI Evaluation for Safety.
Lora Aroyo
Alex S. Taylor
Mark Diaz
Christopher M. Homan
Alicia Parrish
Greg Serapio-Garcia
Vinodkumar Prabhakaran
Ding Wang
Published in:
CoRR (2023)
Keyphrases
</>
artificial intelligence
machine learning
evaluation model
real world
expert systems
benchmark datasets
evaluation metrics
decision trees
natural language
real life
knowledge representation
intelligent systems
multi modal
evaluation method
evaluation criteria