Improving Persian Text Classification and Clustering Using Persian Thesaurus.
Hamid ParvinAtousa DahbashiSajad ParvinBehrouz Minaei-BidgoliPublished in: DCAI (2012)
Keyphrases
- text classification
- unsupervised learning
- naive bayes
- text categorization
- feature selection
- bag of words
- clustering algorithm
- topic discovery
- text mining
- machine learning
- n gram
- text documents
- text data
- information retrieval systems
- text classifiers
- data cleaning
- distributional clustering
- labeled data
- fuzzy clustering
- domain specific
- document clustering
- knn
- multi label
- k means
- spectral clustering
- cluster analysis
- information theoretic
- data points
- clustering method
- information retrieval
- digital libraries
- training documents
- association rules
- semantic features
- databases
- semantic relations
- hierarchical clustering
- data clustering
- natural language processing
- high dimensional data
- document collections
- query expansion