ActiveClean: Interactive Data Cleaning For Statistical Modeling.
Sanjay KrishnanJiannan WangEugene WuMichael J. FranklinKen GoldbergPublished in: Proc. VLDB Endow. (2016)
Keyphrases
- statistical modeling
- data cleaning
- data integration
- statistical models
- record linkage
- data quality
- outlier detection
- text classification
- data processing
- data warehouse
- fraud detection
- database
- missing values
- data warehousing
- databases
- user interaction
- integrity constraints
- web usage mining
- text mining
- statistical model
- privacy preserving
- high dimensional data
- natural language processing
- data model
- machine learning