Sign in

Toward More Accurate and Generalizable Evaluation Metrics for Task-Oriented Dialogs.

Abishek KommaNagesh Panyam ChandrasekarasastryTimothy LeffelAnuj GoyalAngeliki MetallinouSpyros MatsoukasAram Galstyan
Published in: ACL (industry) (2023)
Keyphrases
  • evaluation metrics
  • precision and recall
  • average precision
  • evaluation methods
  • evaluation framework
  • mixed initiative
  • natural language processing
  • learning to rank
  • evaluation measures
  • dialogue system