Categorizing Web Information on Subject with Statistical Language Modeling.
Xindong ZhouTing WangHuiping ZhouHuowang ChenPublished in: WISE (2004)
Keyphrases
- web information
- statistical language modeling
- web data
- website
- information filtering
- web mining
- search engine
- language model
- web pages
- n gram
- language modeling
- web content
- text classification
- text categorization
- naive bayes classification
- information sources
- deep web
- data mining techniques
- co occurrence
- web search
- databases