A First-Principles Algebraic Approach to Data Transformations in Data Cleaning: Understanding Provenance from the Ground Up.
Santiago Núñez CorralesLan LiBertram LudäscherPublished in: TaPP (2020)
Keyphrases
- data cleaning
- data quality
- data sets
- database
- data processing
- data from multiple sources
- data integration
- record linkage
- knowledge discovery
- input data
- web data
- databases
- data analysis
- text classification
- web services
- high dimensional data
- natural language
- association rule mining
- outlier detection
- missing values
- search engine