Development of Hybrid Classification Methodology for Mining Skewed Data Sets - A Case Study of Indian Customs Data.
Anuj KumarVishnuprasad NagadevaraPublished in: AICCSA (2006)
Keyphrases
- data sets
- database
- classification accuracy
- data analysis
- data collection
- data mining techniques
- synthetic data
- knowledge discovery
- training data
- data sources
- original data
- decision trees
- data mining applications
- case study
- input data
- raw data
- data mining tasks
- data mining algorithms
- data mining methods
- high dimensional data
- benchmark data sets
- training samples
- data reduction
- software engineering
- training set
- feature extraction
- feature selection
- interesting patterns
- large scale data sets
- heterogeneous data sets
- fit in main memory
- gene expression data
- missing values
- text classification
- feature set
- data points
- feature vectors
- association rules