Reporting bias when using real data sets to analyze classification performance.
Mohammadmahdi R. YousefiJianping HuaChao SimaEdward R. DoughertyPublished in: Bioinform. (2010)
Keyphrases
- machine learning
- pattern recognition
- feature selection
- feature vectors
- automatic classification
- classification systems
- support vector
- decision trees
- feature extraction
- pattern classification
- classification scheme
- classification accuracy
- machine learning algorithms
- classification process
- supervised classification
- benchmark data sets
- classification method
- object classification
- training samples
- training set
- feature space
- preprocessing
- information retrieval
- data mining
- data sets
- database
- learning vector quantization
- document classification
- business intelligence
- text classification
- support vector machine