Automatic de-identification of Data Download Packages.
Laura BoeschotenRoos VoorvaartCasper S. KaandorpRuben van den GoorberghMartine de VosPublished in: CoRR (2021)
Keyphrases
- data sets
- complex data
- data collection
- database
- data analysis
- raw data
- data processing
- high quality
- probability distribution
- application domains
- computer systems
- data sources
- data structure
- big data
- data objects
- noisy data
- data quality
- historical data
- small number
- data mining algorithms
- data points
- clustering algorithm
- knowledge base
- information systems
- information retrieval
- real time