Overview of Robust and Multilingual Automatic Evaluation Metrics for Open-Domain Dialogue Systems at DSTC 11 Track 4.
Mario Rodríguez-CantelarChen ZhangChengguang TangKe ShiSarik GhazarianJoão SedocLuis Fernando D'HaroAlexander RudnickyPublished in: CoRR (2023)
Keyphrases
- machine learning
- evaluation metrics
- open domain
- dialogue system
- information extraction
- precision and recall
- natural language
- spoken dialogue systems
- evaluation measures
- question answering
- learning to rank
- reinforcement learning
- document collections
- collaborative filtering
- semi supervised
- question answering systems
- digital libraries