Latent Dirichlet Allocation for Automatic Document Categorization.
István BíróJácint SzabóPublished in: ECML/PKDD (2) (2009)
Keyphrases
- latent dirichlet allocation
- document categorization
- topic models
- topic modeling
- text documents
- text categorization
- text classification
- text mining
- generative model
- document representation
- document classification
- latent semantic indexing
- information extraction
- document clustering
- dimensionality reduction
- data mining
- support vector machine
- similarity measure
- feature extraction
- knowledge base
- web pages
- feature selection