A Comprehensive Data Quality Methodology for Web and Structured Data.
Carlo BatiniFederico CabitzaCinzia CappielloChiara FrancalanciPublished in: ICDIM (2006)
Keyphrases
- structured data
- data quality
- semi structured data
- linked data
- structured information
- textual data
- unstructured data
- semi structured
- information extraction
- unstructured information
- unstructured text
- website
- structured and unstructured data
- data warehouse
- web documents
- web data
- data sources
- data cleaning
- web pages
- xml documents
- keyword search
- keyword queries
- web databases
- web content
- structured databases
- database
- page contents
- web mining
- text categorization
- semantic web
- end users
- metadata
- search interface
- big data
- information management
- data mining
- real world