Scalable Iterative Classification for Sanitizing Large-Scale Datasets.
Bo LiYevgeniy VorobeychikMuqun LiBradley A. MalinPublished in: IEEE Trans. Knowl. Data Eng. (2017)
Keyphrases
- association rules
- associative classifiers
- classification rules
- benchmark datasets
- uci repository
- classification accuracy
- machine learning
- web scale
- pattern classification
- decision trees
- database
- image classification
- classification systems
- classification scheme
- feature extraction
- automatic classification
- small scale
- data sets
- uci machine learning repository
- classification method
- million images
- decision rules
- pattern recognition
- text classification
- preprocessing
- supervised learning
- feature vectors
- real life
- hierarchical text classification
- class labels
- high scalability
- real world
- support vector
- training set
- data intensive
- training dataset
- feature set
- classification models
- cost sensitive
- high dimensionality
- training samples
- machine learning methods
- support vector machine svm
- classification algorithm