SART - Similarity, Analogies, and Relatedness for Tatar Language: New Benchmark Datasets for Word Embeddings Evaluation.
Albina KhusainovaAdil KhanAdín Ramírez RiveraPublished in: CoRR (2019)
Keyphrases
- benchmark datasets
- co occurrence
- ensemble methods
- evaluation measures
- semantic similarity
- word similarity
- uci repository
- distance measure
- similarity measure
- uci machine learning repository
- natural language
- terms of classification accuracy
- information extraction
- keywords
- feature selection
- machine translation system
- english text
- lexical information
- information retrieval