Unsupervised Machine Learning based Documents Clustering in Urdu.
Atta Ur RahmanKhairullah KhanWahab KhanAurangzeb KhanBibi SaqiaPublished in: EAI Endorsed Trans. Scalable Inf. Syst. (2018)
Keyphrases
- machine learning
- unsupervised learning
- document clustering
- supervised classification
- text clustering
- supervised learning
- automatic text categorization
- data clustering
- text mining
- unsupervised classification
- clustering algorithm
- unsupervised clustering
- unsupervised manner
- topic modeling
- k means
- topic discovery
- unsupervised feature selection
- information retrieval
- hierarchical clustering
- decision trees
- document collections
- clustering method
- machine learning methods
- information bottleneck
- web documents
- self organizing maps
- information theoretic
- machine learning algorithms
- keywords
- semi supervised
- agglomerative clustering
- cluster analysis
- text classification
- information retrieval systems
- natural language processing
- xml documents
- active learning
- completely unsupervised
- cosine similarity
- keyword extraction
- data mining
- document retrieval
- text documents
- relevant documents
- semi supervised learning
- multi document summarization
- search engine
- vector space model
- sentiment analysis
- information extraction
- knowledge discovery