Exploiting information redundancy to wring out structured data from the web.
Lorenzo BlancoMirko BronziValter CrescenziPaolo MerialdoPaolo PapottiPublished in: WWW (2010)
Keyphrases
- structured data
- semi structured data
- information redundancy
- linked data
- structured information
- textual data
- semi structured
- information extraction
- unstructured text
- unstructured data
- web data
- structured and unstructured data
- relational data
- xml documents
- web databases
- semistructured data
- web documents
- unstructured information
- keyword search
- tree structured data
- website
- data sources
- web content
- semantic web
- structured databases
- web mining
- metadata
- database
- web pages
- data sets
- mutual information
- end users
- keyword queries
- web users
- social networks
- knowledge discovery
- natural language processing
- text mining
- image quality