A data traceability method to improve data quality in a big data environment.
Guobao ZhangPublished in: DSC (2020)
Keyphrases
- data quality
- big data
- information loss
- data processing
- data sets
- noisy data
- data analysis
- unstructured data
- data cleaning
- data warehouse
- big data analytics
- vast amounts of data
- privacy guarantees
- data transformation
- digital data
- databases
- cloud computing
- data sources
- database
- data privacy
- missing values
- knowledge discovery
- high dimensional
- raw data
- missing data
- massive data