Automatic Ontology-Based Knowledge Extraction from Web Documents.
Harith AlaniSanghee KimDavid E. MillardMark J. WealWendy HallPaul H. LewisNigel ShadboltPublished in: IEEE Intell. Syst. (2003)
Keyphrases
- web documents
- knowledge extraction
- information extraction
- web pages
- document classification
- knowledge discovery
- textual documents
- semi structured
- data mining
- systems engineering
- web content
- web search engines
- web data
- keywords
- textual information
- document representation
- digital libraries
- semi automatic
- vector space model
- html documents
- machine learning
- content similarity
- relational databases
- web search
- domain specific
- topic specific
- databases
- medical databases
- tree structured patterns