Modeling Web Documents as Objects for Automatic Web Content Extraction - Object-oriented Web Data Model.
Estella AnnoniChristie I. EzeifePublished in: ICEIS (1) (2009)
Keyphrases
- web documents
- web pages
- data model
- semi structured
- object oriented
- web data
- information extraction
- web search engines
- web content
- website
- keywords
- html documents
- content extraction
- web mining
- digital libraries
- text categorization
- semantic web
- databases
- information retrieval
- vector space model
- document representation
- textual information
- link structure
- database