AlphaClean: Automatic Generation of Data Cleaning Pipelines.
Sanjay KrishnanEugene WuPublished in: CoRR (2019)
Keyphrases
- data cleaning
- data integration
- outlier detection
- record linkage
- data quality
- text classification
- missing values
- data processing
- database
- fraud detection
- data warehousing
- web usage mining
- data warehouse
- information extraction
- data model
- integrity constraints
- text mining
- data sets
- databases
- machine learning
- data management
- linked data
- database systems
- data streams
- missing data