CrowdCleaner: A Data Cleaning System Based on Crowdsourcing.
Chen YeHongzhi WangKeli LiQian ChenJianhua ChenJiangduo SongWeidong YuanPublished in: APWeb (2014)
Keyphrases
- data cleaning
- data integration
- text classification
- data quality
- outlier detection
- record linkage
- database
- data warehousing
- fraud detection
- missing values
- data processing
- integrity constraints
- data warehouse
- web usage mining
- case study
- search engine
- cost sensitive
- databases
- detection algorithm
- object oriented
- information extraction
- query language
- active learning
- high dimensional
- decision making
- feature selection
- data mining
- real world