On Extracting Information from Semi-structured Deep Web Documents.
Patricia JiménezRafael CorchueloPublished in: BIS (2015)
Keyphrases
- web documents
- semi structured
- information extraction
- web data
- textual information
- semi structured data
- structured data
- data collections
- web pages
- data extraction
- html documents
- web data sources
- tree structured patterns
- web content
- information integration
- web sources
- keywords
- free text
- information sources
- web search engines
- structured knowledge
- unstructured text
- wrapper generation
- unstructured data
- semistructured documents
- structured documents
- semi automatic
- text mining
- content and structure
- data model
- metadata
- information retrieval
- real world
- data sets