Real-time unsupervised classification of web documents.
Anthony SigogneMatthieu ConstantPublished in: IMCSIT (2009)
Keyphrases
- web documents
- unsupervised classification
- supervised classification
- semi structured
- unsupervised learning
- keywords
- information extraction
- web pages
- clustering ensemble
- data clustering
- html documents
- vector space model
- hyperspectral images
- k means
- probabilistic model
- data streams
- learning algorithm
- remote sensing images