Using tree-grammars for training set expansion in page classification.
Stefano BaldiSimone MarinaiGiovanni SodaPublished in: ICDAR (2003)
Keyphrases
- training set
- classification accuracy
- classification algorithm
- training samples
- tree grammars
- supervised learning
- svm classifier
- feature space
- decision trees
- class labels
- website
- training data
- pattern classification
- decision rules
- cross validation
- feature selection
- decision boundary
- support vector machine
- data sets
- training dataset
- classification scheme
- automatic classification
- machine learning
- machine learning algorithms
- support vector machine svm
- nearest neighbor
- pattern recognition
- feature extraction
- test set
- semi supervised learning
- text classification
- image classification
- classification rules
- face images
- classification error
- class distribution
- knn
- preprocessing