"Real-World" De-Identification of High-Dimensional Transactional Health Datasets.
Kenneth A. MoselleStan RobertsonAndriy KovalPublished in: ITCH (2019)
Keyphrases
- high dimensional
- real world
- high dimensional datasets
- synthetic datasets
- data sets
- low dimensional
- wide range
- data mining
- database
- case study
- dimensional data
- high dimensionality
- synthetic data
- benchmark datasets
- nearest neighbor
- uci machine learning repository
- feature space
- feature selection
- class imbalanced
- health information
- neural network
- synthetic and real datasets
- high dimensional spaces
- decision trees
- sparse data
- parameter space
- gene expression data
- outlier detection
- information systems
- data points
- dimensionality reduction