Bleach: A Distributed Stream Data Cleaning System.
Yongchao TianPietro MichiardiMarko VukolicPublished in: BigData Congress (2017)
Keyphrases
- data cleaning
- outlier detection
- data integration
- record linkage
- text classification
- data quality
- database
- data streams
- data warehouse
- data processing
- data warehousing
- databases
- missing values
- website
- web usage mining
- mobile agents
- machine learning
- fraud detection
- decision making
- missing data
- case study
- text mining
- information extraction
- natural language