Integrated effect of Data Cleaning and Sampling on Decision Tree Learning of Large Data Sets.
Dipak V. PatilRajankumar Sadashivrao BichkarPublished in: Int. J. Comput. (2012)
Keyphrases
- data cleaning
- decision tree learning
- data integration
- decision trees
- data quality
- text classification
- outlier detection
- record linkage
- missing values
- attribute values
- database
- data sets
- data processing
- data warehousing
- constructive induction
- ensemble methods
- web usage mining
- fraud detection
- privacy preserving
- integrity constraints
- databases
- data analysis
- learning algorithm
- meta learning
- inductive learning
- data warehouse
- knn