Cleanix: a Parallel Big Data Cleaning System.
Hongzhi WangMingda LiYingyi BuJianzhong LiHong GaoJiacheng ZhangPublished in: SIGMOD Rec. (2015)
Keyphrases
- big data
- data analysis
- cloud computing
- big data analytics
- data management
- data processing
- data intensive
- high volume
- data visualization
- unstructured data
- business intelligence
- vast amounts of data
- massive data
- social media
- parallel processing
- shared memory
- data science
- knowledge discovery
- health informatics
- massively parallel
- huge data
- data analytics
- massive datasets
- commodity hardware
- data mining
- information processing
- expert systems
- data driven decision making