An Unsupervised Technique to Extract Information from Semi-structured Web Pages.
Hassan A. SleimanRafael CorchueloPublished in: WISE (2012)
Keyphrases
- semi structured
- web documents
- web data
- web pages
- information extraction
- information sources
- web sources
- wrapper generation
- data extraction
- web content
- free text
- databases
- keywords
- web data sources
- html pages
- data collections
- document collections
- logic programming
- information integration
- raw data
- semi structured data
- web search engines
- web data extraction
- knowledge discovery
- website