Large scale link based latent Dirichlet allocation for web document classification
István BíróJácint SzabóPublished in: CoRR (2010)
Keyphrases
- latent dirichlet allocation
- web document classification
- topic models
- generative model
- document classification
- text mining
- lda model
- probabilistic neural network
- probabilistic relational models
- knn
- dimensionality reduction
- ranking algorithm
- probabilistic model
- feature selection
- learning algorithm
- text classification
- information retrieval
- bayesian networks