Beyond Document Page Classification: Design, Datasets, and Challenges.
Jordy Van LandeghemSanket BiswasMatthew B. BlaschkoMarie-Francine MoensPublished in: WACV (2024)
Keyphrases
- benchmark datasets
- document classification
- uci repository
- design principles
- pattern recognition
- keywords
- design process
- image classification
- support vector machine svm
- lessons learned
- database
- classification accuracy
- feature vectors
- feature extraction
- case study
- supervised learning
- web documents
- document images
- classification method
- benchmark data sets
- feature selection
- data sets