RUBICON: Rubric-Based Evaluation of Domain-Specific Human AI Conversations.
Param BiyaniYasharth BajpaiArjun RadhakrishnaGustavo SoaresSumit GulwaniPublished in: AIware (2024)
Keyphrases
- domain specific
- artificial intelligence
- general purpose
- domain independent
- machine learning
- assessment tool
- human communication
- evaluation methods
- expert systems
- ai technologies
- human intelligence
- experimental design
- evaluation criteria
- web intelligence
- evaluation metrics
- human behavior
- computational models
- human experts
- domain experts
- collaborative learning
- case based reasoning