Learning Misclassification Costs for Imbalanced Datasets, Application in Gene Expression Data Classification.
Huijuan LuYige XuMinchao YeKe YanQun JinZhigang GaoPublished in: ICIC (1) (2018)
Keyphrases
- cost sensitive learning
- imbalanced datasets
- gene expression data
- class imbalance
- class distribution
- cost sensitive
- supervised learning
- learning algorithm
- misclassification costs
- feature selection
- microarray
- active learning
- high dimensionality
- classification accuracy
- decision trees
- learning process
- image classification
- learning models
- pattern recognition
- machine learning algorithms
- machine learning
- high dimensional
- learning problems
- benchmark datasets
- naive bayes
- training samples
- text classification
- support vector
- bayesian networks
- feature selection algorithms
- feature extraction
- text mining
- training set
- imbalanced data
- feature vectors