A Survey on Region Extractors from Web Documents.
Hassan A. SleimanRafael CorchueloPublished in: IEEE Trans. Knowl. Data Eng. (2013)
Keyphrases
- web documents
- semi structured
- information extraction
- document classification
- web content
- web pages
- keywords
- web search engines
- html documents
- textual information
- link structure
- vector space model
- web data
- topic specific
- search engine
- database
- content similarity
- information retrieval systems
- web search
- data model
- website
- databases