A Probabilistic Method to Predict Classifier Accuracy on Larger Datasets given Small Pilot Data.
Ethan HarveyWansu ChenDavid M. KentMichael C. HughesPublished in: CoRR (2023)
Keyphrases
- input data
- test data
- synthetic data
- high accuracy
- data sets
- raw data
- computational cost
- prior knowledge
- training samples
- training data
- missing data
- classification method
- small number
- error rate
- database
- noisy data
- support vector machine
- original data
- data points
- uncertain data
- data analysis
- feature selection
- missing values
- training examples
- extracted features
- classification process
- classification trees
- high precision
- clustering method
- probability distribution
- similarity measure
- feature set
- roc curve
- feature space
- classification rate
- bayesian networks
- input vectors
- supervised classifiers