A Proposal for Linguistic Similarity Datasets Based on Commonality Lists.
Dmitrijs MilajevsSascha S. GriffithsPublished in: RepEval@ACL (2016)
Keyphrases
- similarity measure
- distance measure
- uci machine learning repository
- semantic similarity
- natural language
- benchmark datasets
- distance function
- database
- euclidean distance
- document similarity
- linguistic knowledge
- synthetic and real datasets
- similarity metrics
- similarity measurement
- user defined
- higher level
- natural language processing
- social networks