Toward an enhanced Arabic text classification using cosine similarity and Latent Semantic Indexing.
Fawaz S. Al-AnziDia AbuZeinaPublished in: J. King Saud Univ. Comput. Inf. Sci. (2017)
Keyphrases
- latent semantic indexing
- cosine similarity
- text classification
- vector space
- vector space model
- document representation
- document clustering
- bag of words
- text documents
- text mining
- information retrieval
- tf idf
- n gram
- text categorization
- retrieval model
- distance measure
- feature selection
- similarity measure
- labeled data
- text retrieval
- language model
- knn
- semantic similarity
- web documents
- machine learning
- feature vectors
- text data
- similarity search
- singular value decomposition
- similarity function
- pairwise
- semantic features