Revisiting K-Means and Topic Modeling, a Comparison Study to Cluster Arabic Documents.
Mohammad AlhawaratMohamed Osman Ali HegaziPublished in: IEEE Access (2018)
Keyphrases
- topic modeling
- k means
- clustering algorithm
- topic models
- cluster analysis
- latent dirichlet allocation
- text mining
- text classification
- clustering method
- collaborative filtering
- expectation maximization
- arabic documents
- latent variables
- document clustering
- text documents
- information retrieval systems
- information extraction
- data points
- probabilistic model
- prior knowledge
- high dimensional
- information retrieval
- word spotting
- machine learning