Impact of Boolean factorization as preprocessing methods for classification of Boolean data.
Radim BelohlávekJan OutrataMartin TrneckaPublished in: Ann. Math. Artif. Intell. (2014)
Keyphrases
- preprocessing
- data sets
- preprocessing steps
- data mining techniques
- training data
- classification systems
- data reduction
- machine learning methods
- benchmark datasets
- database
- high dimensional data
- data mining methods
- training samples
- data sources
- data analysis
- statistical methods
- input data
- data collection
- benchmark data sets
- machine learning algorithms
- real valued
- data structure
- data points
- classification decisions
- classification trees
- decision trees
- feature extraction
- data quality
- spectral clustering
- original data
- missing values
- missing data
- pattern recognition
- feature vectors
- text classification
- classification accuracy
- classification models
- statistical tests
- feature selection
- knowledge discovery
- large scale data sets
- machine learning
- real valued data