A Comparative Analysis of Latent Variable Models for Web Page Classification.
István BíróAndrás A. BenczúrJácint SzabóAna Gabriela MaguitmanPublished in: LA-WEB (2008)
Keyphrases
- web page classification
- latent variable models
- latent variables
- text classification
- web mining
- automatic classification
- real valued
- latent dirichlet allocation
- web pages
- hidden markov models
- feature selection
- text data
- learning problems
- probabilistic model
- latent space
- hidden variables
- bag of words
- data mining
- topic models
- anchor text
- knn
- search engine
- information retrieval