Scalable Data Quality for Big Data: The Pythia Framework for Handling Missing Values.
Atoshum CahsaiChristos AnagnostopoulosPeter TriantafillouPublished in: Big Data (2015)
Keyphrases
- data quality
- missing values
- big data
- data cleaning
- missing data
- unstructured data
- incomplete data
- data management
- data warehouse
- knowledge discovery
- data processing
- high dimensional data
- business intelligence
- data preparation
- data warehousing
- databases
- cloud computing
- supervised learning
- social media
- database systems
- decision making
- machine learning