Document Clustering vs Topic Models: A Case Study.
Meng YuanPauline LinJustin ZobelPublished in: ADCS (2021)
Keyphrases
- document clustering
- topic models
- text documents
- text mining
- topic modeling
- latent dirichlet allocation
- document representation
- document classification
- text analysis
- negative matrix factorization
- clustering algorithm
- document collections
- tf idf
- news articles
- generative model
- co occurrence
- clustering method
- text classification
- probabilistic model
- vector space model
- information retrieval
- machine learning
- named entities
- information extraction
- cross lingual
- training data
- text collections
- data mining
- question answering
- databases
- natural language processing
- active learning
- k means