From Papers to Practice: The openclean Open-Source Data Cleaning Library.
Heiko MuellerSonia CasteloMunaf A. QaziJuliana FreirePublished in: Proc. VLDB Endow. (2021)
Keyphrases
- data cleaning
- open source
- data integration
- text classification
- record linkage
- data quality
- outlier detection
- data processing
- database
- data warehousing
- missing values
- web usage mining
- case study
- data warehouse
- integrity constraints
- fraud detection
- database management systems
- user behavior
- website
- machine learning
- real world
- databases