Do We Need Real Data? - Testing and Training Algorithms with Artificial Geolocation Data.
Jan KaiserKai BavendiekSibylle SchuppPublished in: GI-Jahrestagung (2019)
Keyphrases
- data sets
- synthetic data
- data structure
- data mining algorithms
- training data
- data reduction
- raw data
- original data
- data analysis
- data processing
- data collection
- data sources
- test data
- synthetic datasets
- database
- high quality
- data quality
- machine learning
- experimental data
- learning algorithm
- sensor data
- data objects
- noisy data
- statistical analysis
- computer systems
- optimization problems
- probability distribution
- labelled data
- statistical methods
- data distribution
- missing data
- test set
- high dimensional data
- input data
- knowledge discovery
- computational cost