Tangled up in BLEU: Reevaluating the Evaluation of Automatic Machine Translation Evaluation Metrics.
Nitika MathurTimothy BaldwinTrevor CohnPublished in: ACL (2020)
Keyphrases
- machine translation
- evaluation metrics
- lexical cohesion
- precision and recall
- average precision
- information extraction
- natural language processing
- cross lingual
- target language
- mt evaluation
- language independent
- evaluation measures
- learning to rank
- statistical machine translation
- machine translation system
- word alignment
- chinese english
- natural language
- relevance judgments
- cross language information retrieval
- parallel corpora
- query translation
- benchmark datasets
- collaborative filtering
- semi supervised