Latent dirichlet allocation in web spam filtering.
István BíróJácint SzabóAndrás A. BenczúrPublished in: AIRWeb (2008)
Keyphrases
- spam filtering
- latent dirichlet allocation
- topic models
- topic modeling
- lda model
- text classification
- topic discovery
- generative model
- text mining
- variational bayesian inference
- latent topics
- spam filters
- probabilistic topic models
- web documents
- probabilistic latent semantic analysis
- gibbs sampling
- variational inference
- latent topic models
- web pages
- artificial intelligence
- hierarchical bayesian model
- word counts
- real world
- spam detection
- information retrieval
- co occurrence
- visual recognition
- bayesian networks
- latent variables
- text documents
- probabilistic model
- machine learning