Structural handwritten and machine print classification for sparse content and arbitrary oriented document fragments.
Sukalpa ChandaKatrin FrankeUmapada PalPublished in: SAC (2010)
Keyphrases
- document classification
- web documents
- pattern recognition
- support vector machine
- document content
- classification method
- feature space
- supervised learning
- information retrieval
- document analysis
- training set
- classification accuracy
- automatic classification
- semantic information
- classification algorithm
- structural information
- image classification
- feature vectors
- multimedia documents
- feature extraction
- textual content
- feature selection
- document collections
- support vector machine svm
- text categorization
- text classification
- sparse representation
- high dimensional
- support vector
- decision trees
- structured documents
- relevant content
- metadata
- handwritten documents
- content and structure
- document structure
- information retrieval systems
- document representation
- character recognition
- retrieval systems
- machine learning