Semi-Supervised Data Cleaning with Raha and Baran.
Mohammad MahdaviZiawasch AbedjanPublished in: CIDR (2021)
Keyphrases
- data cleaning
- semi supervised
- data integration
- duplicate detection
- record linkage
- data quality
- outlier detection
- text classification
- database
- pairwise
- active learning
- data processing
- data warehousing
- missing values
- web usage mining
- information extraction
- data warehouse
- data model
- fraud detection
- case study
- integrity constraints
- multi class
- query evaluation
- data management
- text mining
- principal component analysis
- natural language
- machine learning
- databases