Operationalizing Assurance Cases for Data Scientists: A Showcase of Concepts and Tooling in the Context of Test Data Quality for Machine Learning.
Lisa JöckelMichael KläsJanek GroßPascal GerberMarkus ScholzJonathan EberleMarc TeschnerDaniel SeifertRichard HawkinsJohn MolloyJens OttnadPublished in: CoRR (2023)
Keyphrases
- data quality
- machine learning
- data analysis
- data sets
- database
- high energy physics
- original data
- data cleaning
- quality management
- data transformation
- data processing
- information loss
- data cleansing
- training data
- data privacy
- quality assessment
- data collection
- data warehouse
- data sources
- poor quality
- data preparation
- knowledge discovery
- natural resources
- databases