Document Clustering Meets Topic Modeling with Word Embeddings.
Gianni CostaRiccardo OrtalePublished in: SDM (2020)
Keyphrases
- topic modeling
- document clustering
- topic extraction
- text mining
- text documents
- topic models
- text classification
- latent dirichlet allocation
- co occurrence
- keywords
- tf idf
- text analysis
- n gram
- vector space model
- negative matrix factorization
- document collections
- clustering algorithm
- clustering method
- document classification
- k means
- knowledge discovery
- named entities
- vector space
- collaborative filtering
- search engine
- low dimensional
- real world
- cluster analysis
- text categorization
- latent variables
- dimensionality reduction
- information extraction
- data mining