Improving accuracy of classification models induced from anonymized datasets.
Mark LastTamir TassaAlexandra ZhmudyakErez ShmueliPublished in: Inf. Sci. (2014)
Keyphrases
- classification models
- models built
- training data
- decision trees
- feature selection
- imbalanced data
- feature set
- classification accuracy
- learning models
- privacy preserving
- software quality classification
- benchmark datasets
- uci machine learning repository
- attribute selection
- decision tree algorithm
- data sets
- database
- information loss
- sensitive information
- software engineering
- social networks