Linked latent Dirichlet allocation in web spam filtering.
István BíróDávid SiklósiJácint SzabóAndrás A. BenczúrPublished in: AIRWeb (2009)
Keyphrases
- spam filtering
- latent dirichlet allocation
- topic models
- topic modeling
- generative model
- text classification
- text mining
- topic discovery
- lda model
- variational bayesian inference
- anti spam
- hierarchical bayesian model
- gibbs sampling
- web documents
- latent topics
- spam detection
- spam filters
- probabilistic topic models
- web pages
- dimensionality reduction
- latent topic models
- web spam
- variational inference
- link analysis
- probabilistic latent semantic analysis
- information retrieval
- information extraction
- knn
- feature space
- support vector
- keywords
- artificial intelligence