Validating Data and Models in Continuous ML Pipelines.
Mike DrevesGene HuangZhuo PengNeoklis PolyzotisEvan RosenPaul Suganthan G. C.Published in: IEEE Data Eng. Bull. (2021)
Keyphrases
- experimental data
- historical data
- data sets
- data analysis
- raw data
- synthetic data
- big data
- incomplete data
- data collection
- data processing
- complex data
- data sources
- prior knowledge
- data structure
- database
- training data
- learning models
- original data
- image data
- discrete data
- feature selection
- learned models
- accurate models
- data quality
- subject specific
- network structure
- data distribution
- application domains
- data mining algorithms
- sensor data
- small number
- knowledge discovery
- probabilistic model
- feature space
- bayesian networks