Optimistic bias in the assessment of high dimensional classifiers with a limited dataset.
Weijie ChenDavid G. BrownPublished in: IJCNN (2011)
Keyphrases
- high dimensional
- feature set
- feature selection
- decision trees
- training data
- high dimensional datasets
- small sample
- dimensionality reduction
- bias variance decomposition
- nearest neighbor
- low dimensional
- training samples
- test set
- support vector
- sparse data
- high dimensionality
- multiple classifiers
- training dataset
- class labels
- machine learning algorithms
- naive bayes
- high dimensional data
- benchmark datasets
- linear classifiers
- fold cross validation
- classification method
- classification algorithm
- noisy data
- classification models
- variable selection
- synthetic datasets
- image classification
- high dimensional spaces
- training set
- feature space
- data points
- semi supervised