Text mining and probabilistic language modeling for online review spam detection.
Raymond Y. K. LauStephen Shaoyi LiaoRon Chi-Wai KwokKaiquan XuYunqing XiaYuefeng LiPublished in: ACM Trans. Manag. Inf. Syst. (2011)
Keyphrases
- language modeling
- spam detection
- text mining
- probabilistic model
- language model
- information retrieval
- text classification
- retrieval model
- spam filtering
- query expansion
- n gram
- generative model
- information extraction
- natural language processing
- bayesian networks
- text documents
- document clustering
- uncertain data
- fraud detection
- link analysis
- knowledge discovery
- web graph
- machine learning
- web mining
- distance measure
- topic modeling
- feature vectors
- website
- data mining
- language modeling framework