Clustering documents into a web directory for bootstrapping a supervised classification.
Giordano AdamiPaolo AvesaniDiego SonaPublished in: Data Knowl. Eng. (2005)
Keyphrases
- supervised classification
- web directories
- unsupervised clustering
- unsupervised classification
- unsupervised learning
- document clustering
- supervised learning
- vector space model
- web documents
- information retrieval
- document collections
- website
- search engine
- clustering algorithm
- k means
- keywords
- vector space
- hierarchical structure
- document retrieval
- text documents
- information extraction
- image processing
- relevant documents
- information retrieval systems
- search tools
- data sets
- data mining
- metadata
- web browser
- user queries
- database
- model selection
- co occurrence