A greedy feature selection algorithm for Big Data of high dimensionality.
Ioannis TsamardinosGiorgos BorboudakisPavlos KatsogridakisPolyvios PratikakisVassilis ChristophidesPublished in: Mach. Learn. (2019)
Keyphrases
- high dimensionality
- big data
- feature selection
- feature selection algorithms
- cloud computing
- big data analytics
- small sample size
- text categorization
- data processing
- irrelevant features
- social media
- business intelligence
- text classification
- feature set
- data management
- feature space
- gene expression data
- dimensionality reduction
- data analysis
- redundant features
- knowledge discovery
- classification accuracy
- feature ranking
- data warehousing
- selected features
- support vector machine
- knn
- imbalanced datasets
- support vector
- microarray data
- feature subset
- gene selection
- class imbalance
- genetic algorithm
- information processing
- feature extraction
- data mining
- feature vectors
- decision making
- high dimensional
- microarray
- databases