Automatic de-identification of data download packages.
Laura BoeschotenRoos VoorvaartRuben van den GoorberghCasper S. KaandorpMartine de VosPublished in: Data Sci. (2021)
Keyphrases
- raw data
- training data
- high quality
- data collection
- complex data
- experimental data
- data analysis
- spatial data
- image data
- synthetic data
- statistical analysis
- data sets
- big data
- application domains
- data mining algorithms
- missing data
- input data
- data mining techniques
- data processing
- probability distribution
- database
- historical data
- genetic algorithm
- data quality
- noisy data
- knowledge base
- multimedia data
- missing values
- fully automatic
- data distribution
- data structure
- high dimensional
- end users
- data points