Statistical Distortion: Consequences of Data Cleaning
Tamraparni DasuJi Meng LohPublished in: CoRR (2012)
Keyphrases
- data cleaning
- data integration
- data quality
- record linkage
- text classification
- outlier detection
- data warehousing
- fraud detection
- data processing
- missing values
- database
- information extraction
- data warehouse
- web usage mining
- decision support
- database systems
- information retrieval
- real world
- data sources
- data model
- relational databases
- high dimensional
- case study
- website
- decision making
- search engine