Data quality in web archiving.
Marc SpaniolDimitar DenevArturas MazeikaGerhard WeikumPierre SenellartPublished in: WICOW (2009)
Keyphrases
- data quality
- website
- quality management
- data cleansing
- data confidentiality
- data cleaning
- data transformation
- web pages
- poor quality
- data warehouse
- quality assessment
- information loss
- web content
- linked data
- class noise
- data analysis
- web mining
- data privacy
- real world
- natural resources
- cell suppression
- database
- privacy guarantees
- privacy preservation
- data collection
- text mining
- databases