Urdu Documents Clustering with Unsupervised and Semi-Supervised Probabilistic Topic Modeling.
Mubashar MustafaFeng ZengHussain GhulamHafiz Muhammad ArslanPublished in: Inf. (2020)
Keyphrases
- topic modeling
- topic models
- unsupervised and semi supervised
- topic extraction
- semi supervised
- probabilistic topic models
- document clustering
- monolingual and cross lingual
- text documents
- text mining
- clustering algorithm
- generative model
- maximum margin
- tag information
- latent dirichlet allocation
- latent topics
- text classification
- k means
- clustering method
- probabilistic model
- probabilistic latent semantic analysis
- unsupervised learning
- collaborative filtering
- document classification
- information retrieval
- latent variables
- sentiment analysis
- spectral clustering
- data points
- natural language
- learning algorithm