Deriving and comparing deduplication techniques using a model-based classification.
Jürgen KaiserAndré BrinkmannTim SüßDirk MeisterPublished in: EuroSys (2015)
Keyphrases
- classification accuracy
- pattern recognition
- classification process
- object classification
- machine learning
- classification systems
- decision trees
- automatic classification
- decision rules
- classification algorithm
- machine learning methods
- unsupervised learning
- benchmark data sets
- training set
- training data
- support vector
- feature selection
- databases
- roc curve
- pattern classification
- classification rate
- genetic algorithm
- classification method
- cross validation
- benchmark datasets
- support vector machine svm
- model selection
- text classification
- image classification
- support vector machine
- high dimensional
- feature space
- object recognition
- feature extraction