URL-Based Web Page Classification: With n-Gram Language Models.
Tarek Amr AbdallahBeatriz de la IglesiaPublished in: IC3K (Selected Papers) (2014)
Keyphrases
- web page classification
- web pages
- anchor text
- text classification
- n gram
- website
- language modeling
- web search
- web mining
- automatic classification
- keywords
- web documents
- test collection
- query logs
- feature selection
- search engine
- language model
- web search engines
- data mining
- information retrieval
- machine learning
- image classification
- web data
- link analysis
- document representation
- text mining
- search tasks
- web graph