Feature selection for genomic data sets through feature clustering.
Fengbin ZhengXiajiong ShenZhengye FuShanshan ZhengGuangrong LiPublished in: Int. J. Data Min. Bioinform. (2010)
Keyphrases
- data sets
- feature selection
- feature set
- feature selection algorithms
- selecting features
- irrelevant features
- high dimensionality
- redundant features
- discriminative features
- unsupervised learning
- feature subset
- feature importance
- unsupervised feature selection
- k means
- clustering algorithm
- image features
- high dimensional data
- clustering method
- feature subspace
- text categorization
- high dimensional data sets
- validity indices
- machine learning
- data points
- pointwise mutual information
- data clustering
- document clustering
- data pre processing
- feature weighting
- mixed data
- multiple features
- preprocessing step
- microarray data
- cluster analysis
- self organizing maps
- feature space
- feature vectors
- supervised feature selection
- hierarchical clustering
- feature extraction
- support vector
- data streams
- discriminatory power
- training set
- feature ranking
- categorical data
- classification accuracy
- knn
- high dimensional
- cluster structure
- text classification
- input features
- gene expression data
- similarity measure
- sequence data