Batchwise Probabilistic Incremental Data Cleaning.
Paulo H. OliveiraDaniel S. KasterCaetano Traina Jr.Ihab F. IlyasPublished in: CoRR (2020)
Keyphrases
- data cleaning
- data integration
- record linkage
- outlier detection
- text classification
- data quality
- database
- data processing
- data warehousing
- fraud detection
- bayesian networks
- missing values
- databases
- data warehouse
- web usage mining
- data sources
- case study
- business intelligence
- database systems
- decision support
- integrity constraints
- high dimensional
- machine learning
- linked data
- text mining
- nearest neighbor
- active learning
- data extraction
- data sets