Multi-attributes Image Analysis for the Classification of Web Documents Using Unsupervised Technique.
Samuel W. K. ChanPublished in: IDEAL (2005)
Keyphrases
- web documents
- document classification
- image analysis
- pattern recognition
- information extraction
- web pages
- supervised learning
- semi structured
- keywords
- unsupervised learning
- classification algorithm
- web search engines
- machine learning
- automatic classification
- feature space
- textual information
- feature selection
- image classification
- training set
- text categorization
- web content
- vector space model
- similarity measure
- link structure
- html documents