Synergistic effects between data corpora properties and machine learning performance in data pipelines.
Roberto BertoliniStephen J. FinchPublished in: Int. J. Data Min. Model. Manag. (2022)
Keyphrases
- data analysis
- data sources
- synthetic data
- high quality
- knowledge discovery
- machine learning
- data points
- data collection
- data processing
- data sets
- complex data
- raw data
- statistical methods
- statistical analysis
- computer systems
- data structure
- training data
- input data
- data management
- small number
- high dimensional data
- background knowledge
- application domains
- artificial intelligence
- databases
- database