Hierarchical Multidimensional Classification of Web Documents with MultiWebClass.
Francesco SerafinoGianvito PioMichelangelo CeciDonato MalerbaPublished in: Discovery Science (2015)
Keyphrases
- web documents
- document classification
- information extraction
- web pages
- semi structured
- automatic classification
- web search engines
- web content
- training set
- supervised learning
- image classification
- keywords
- feature selection
- feature set
- feature space
- classification algorithm
- textual information
- link structure
- topic specific