A knowledge-based approach for duplicate elimination in data cleaning.
Wai Lup LowMong-Li LeeTok Wang LingPublished in: Inf. Syst. (2001)
Keyphrases
- data cleaning
- duplicate elimination
- sql queries
- relational algebra
- space partitioning
- data integration
- data quality
- record linkage
- text classification
- outlier detection
- query optimization
- join algorithms
- data processing
- database
- fraud detection
- data warehousing
- relational databases
- missing values
- data model
- path expressions
- query optimizer
- relational database systems
- query evaluation
- information extraction
- xml queries
- machine learning
- web usage mining
- database systems
- data warehouse
- integrity constraints
- multi dimensional
- feature selection
- data sets