Data Preparation for Duplicate Detection.
Ioannis K. KoumarelasLan JiangFelix NaumannPublished in: ACM J. Data Inf. Qual. (2020)
Keyphrases
- duplicate detection
- data preparation
- data quality
- data cleaning
- knowledge discovery in databases
- preprocessing
- knowledge discovery
- data mining
- web usage mining
- data analysis
- pattern discovery
- feature selection
- machine learning
- knowledge discovery and data mining
- data reduction
- instance selection
- information extraction
- process model
- web mining
- record linkage
- databases
- outlier detection
- information retrieval