RUBICON: Rubric-Based Evaluation of Domain-Specific Human AI Conversations.

Param Biyani Yasharth Bajpai Arjun Radhakrishna Gustavo Soares Sumit Gulwani

Published in: AIware (2024)

Keyphrases

domain specific
artificial intelligence
general purpose
domain independent
machine learning
assessment tool
human communication
evaluation methods
expert systems
ai technologies
human intelligence
experimental design
evaluation criteria
web intelligence
evaluation metrics
human behavior
computational models
human experts
domain experts
collaborative learning
case based reasoning