Classification of Web Documents Using Concept Extraction from Ontologies.
Marina LitvakMark LastSlava KisilevichPublished in: AIS-ADM (2007)
Keyphrases
- web documents
- document classification
- information extraction
- semi structured
- keywords
- web search engines
- automatic classification
- web pages
- semantic association
- classification algorithm
- web directories
- domain specific
- natural language processing
- databases
- web content
- image classification
- automatic extraction
- textual information
- feature selection
- machine learning
- unstructured documents
- classify documents
- web information extraction
- focused crawling
- html documents
- document representation
- training set
- domain knowledge
- semantic web