The price of debiasing automatic metrics in natural language evaluation.
Arun Tejasvi ChagantyStephen MussmannPercy LiangPublished in: CoRR (2018)
Keyphrases
- natural language
- evaluation methods
- evaluation metrics
- semi automatic
- evaluation criteria
- automatic evaluation
- natural language processing
- fully automatic
- evaluation method
- machine learning
- evolutionary algorithm
- evaluation model
- evaluation methodology
- knowledge representation
- question answering
- comparative evaluation
- similarity metrics
- natural language interface