Web Document Classification based on Hyperlinks and Document Semantics.
Yin-Hung KuoMan Hon WongPublished in: PRICAI Workshop on Text and Web Mining (2000)
Keyphrases
- web document classification
- document classification
- web documents
- semantic information
- probabilistic neural network
- text documents
- text classification
- text categorization
- information retrieval
- web pages
- document retrieval
- classification algorithm
- keywords
- term frequency inverse document frequency
- knn
- probabilistic relational models
- document clustering
- logic programming
- text mining
- retrieval systems
- structured data
- document collections
- information retrieval systems
- co occurrence
- search engine