Login / Signup
The Validity of Evaluation Results: Assessing Concurrence Across Compositionality Benchmarks.
Kaiser Sun
Adina Williams
Dieuwke Hupkes
Published in:
CoNLL (2023)
Keyphrases
</>
empirical evaluation
databases
learning algorithm
information systems
multiresolution
relevance feedback
information retrieval systems
evaluation measures
evaluation criteria
comparative evaluation
evaluation framework
assessment process